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Abstract 

In  recent  years,  methods  of  incorporating  fault-tolerance  in  computer  architectures 
have  shifted  from  the  exclusive  focus  on  traditional  modular  redundancy  techniques 
to  include  more  efficient,  analytical  means  of  achieving  the  same  result.  Song  and 
Musicus  (1990)  describe  such  a  method  of  analytic  fault-tolerance  by  using  a  statis¬ 
tical  test  and  minimal  redundancy  to  protect  linear  operations.  Aliphas,  Wei,  and 
Musicus  (1991)  in  turn  developed  a  hardware  prototype  to  test  this  method  of  sta¬ 
tistical  fault-tolerance.  Using  this  hardware  prototype,  a  multidirectional,  digital 
interpolation  sonar  beamformer  was  implemented.  Whereeis  the  original  prototype 
used  multiple  processors  performing  the  same  task  on  different  input  data,  this  im¬ 
plementation  uses  a  different  architecture  whereby  the  same  data  is  sent  to  multiple 
processors  performing  different  tasks.  The  digital  beamformer  relaxes  the  sampling 
requirement  typically  required  by  sampling  the  incoming  waveform  at  the  Nyquist 
rate  and  upsampling  to  form  the  beam.  Implementing  the  beamformer  on  the  cur¬ 
rent  architecture  offered  double  fault  detection  and  single  fault  correction  capabilities 
across  a  range  of  look  angles.  In  conjunction  with  previous  results,  the  results  of  this 
thesis  verify  that  the  same  fault  detection  and  correction  algorithm  may  be  used  for 
architectures  both  with  processors  performing  different  operations  on  the  same  data, 
and  with  processors  performing  the  same  operation  on  different  data. 
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Chapter  1 


Introduction 


In  a  conventional  N-modular  redundant  computer  architecture,  N  copies  of  processors, 
memory,  and  I/O  units  are  driven  with  the  same  program  and  the  same  data.  The 
outputs  of  these  N  units  are  then  compared  to  verify  that  they  are  operating  correctly. 
Although  it  is  possible  for  these  particular  schemes  to  reliably  detect  or  even  correct 
up  to  {N  —  l)/2  errors,  a  large  amount  of  additional  hardware  must  be  dedicated 
to  monitoring  any  potential  faults,  and  the  synchronization  required  between  the 
processors  is  very  demanding.  Traditional  triple-modular  redundancy,  then,  is  capable 
of  single  error  correction,  but  at  the  expense  of  at  leeist  200%  overhead. 

One  method  of  incorporating  fault-tolerance  relies  upon  low  redundancy,  arith¬ 
metic  coding  to  protect  linear  computations  against  failure  [2,  9].  This  approach  uses 
a  generalized  likelihood  ratio  test  (GLRT)  in  order  to  statistically  produce  the  best 
possible  fault  decisions  in  applications  where  the  calculations  are  subject  to  trunca¬ 
tion  or  rounding  errors.  The  architecture  utilizes  a  parallel  bank  of  signal  processors 
performing  the  same  linear  function,  where  each  processor  operates  on  its  own  unique 
input.  This  method  of  fault-tolerance  was  tested  with  a  hardware  prototype  using 
commercial  processors,  and  successfully  demonstrated  [2]. 

A  new  approach  to  incorporating  such  analytic  fault-tolerance  has  been  taken 
that  creates  the  opportunity  for  an  entirely  new  class  of  applications.  By  effectively 
transposing  the  previous  architecture,  a  design  is  achieved  wherein  a  common  input 
is  broadcast  to  a  bank  of  signal  processors  each  performing  its  own  unique  function. 
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Since  one  of  the  beneficial  properties  of  linear  operations  is  commutativity,  it  will  be 
seen  that  the  same  generalized  likelihood  ratio  test  will  apply  to  the  error  detection 
and  correction  of  this  new  architecture.  In  addition,  the  opportunities  for  combin¬ 
ing  the  two  architectures  exist,  and  open  the  door  to  a  host  of  signal  processing 
applications. 

To  demonstrate  the  principles  of  this  signal  processing  architecture,  a  multidirec¬ 
tional  sonar  beamformer  was  chosen.  Real  sonar  data  was  collected,  and  tested  using 
the  beamformer  programmed  with  existing  hardware.  Because  of  the  low  overhead 
involved  in  arithmetic  redundancy,  such  a  sonar  system  is  ideally  suited  for  small, 
unmanned  applications  where  both  power  and  size  are  constrained.  This  application 
is  only  one,  however,  and  certainly  several  others  exist  that  could  effectively  utilize 
such  analytic  fault-tolerance. 

This  thesis  is  broken  into  four  remaining  chapters.  Chapter  2  reviews  the  deriva¬ 
tion  of  analytic  fault-tolerance  for  a  single  function  architecture  with  multiple  inputs. 
Chapter  3  begins  the  work  done  for  this  thesis  by  extending  the  derivations  of  Chapter 
2,  and  arriving  at  the  new  signal  processing  architecture  with  a  single  input  and  mul¬ 
tiple  functions.  Chapter  4  discusses  the  aspects  of  digital  interpolation  beamforming, 
and  the  implementation  of  a  digital  interpolation  beamformer  onto  this  single  input, 
multi-function  architecture.  Chapter  5  offers  conclusions,  and  recommendations  for 
future  work. 
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Chapter  2 


Multi-Input  Single  Function 
(MISF)  Fault-Tolerance 


2.1  Aspects  of  MISF  fault-tolerance 

To  meet  the  demands  of  many  computationally  intensive  signal  processing  applica¬ 
tions,  massively  parallel  multiprocessor  architectures  are  often  used  [8].  Some  appli¬ 
cations  may  require  processors  operating  in  parallel,  and  performing  the  same  task. 
For  these  types  of  applications,  it  is  desirable  to  send  different  data  to  the  different 
processors  in  order  to  increase  the  total  processing  throughput.  If  N  processors  are 
used  in  such  an  architecture,  the  total  processing  throughput  is  thus  approximately 
N  times  that  of  a  single  processor. 

To  protect  these  N  processors  from  failure  using  triple  modular  redundancy  would 
require  an  additional  2N  processors,  as  well  as  multiple  voters  to  compare  the  outputs 
of  all  processors.  Figure  2.1  shows  a  triplicately  modular  redundant  system  for  a  single 
processor.  Each  voter  declares  a  processor’s  output  faulty  if  it  fails  to  agree  with 
the  majority  of  all  processor  outputs.  Majority  here  means  two,  so  triple  modular 
redundancy  is  capable  of  isolating  a  single  error.  In  theory,  a  single  voter  can  isolate 
an  error  from  such  a  triplicated  processor  scheme.  Additional  voters  are  needed, 
however,  in  the  event  that  one  fails.  How  the  output  of  these  triplicated  voters  are 
then  used  depends  upon  the  particular  application.  An  alternative  to  this  method  of 
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Figure  2.1;  Triple  Modular  Redundancy  for  a  single  processor 


fault-tolerance  is  to  incorporate  the  redundancy  analytically. 

2.1.1  Adding  redundancy 

Figure  2.2  shows  how  redundancy  is  incorporated  arithmetically  with  minimal  over¬ 
head. 


Figure  2.2:  MISF  fault-tolerant  architecture 

In  a  manner  analogous  to  adding  parity  bits  to  a  data  word,  C  “checksum”  pro¬ 
cessors  are  added  to  protect  N  “working”  processors.  Let  Xfc(m)  represent  a  p  length 
data  vector,  or  packet,  sent  to  processor  k  at  time  m,  where  1  <  A:  <  A^.  Each  of  the 
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C  checksum  processors  then  computes  a  different  weighted  linear  combination  of  the 
packets  at  time  m,  forming  new  packets  [2]: 

N 

Xj{m)  =  ^Wj,kXk{m)  ioT  j  =  N  +  1,. . .  ,N  +  C  (2.1) 

k=l 

where  the  Wj^k  are  scalar  weights.  The  same  linear  function  F  is  then  applied  to  the 
inputs  of  all  iV  +  C  processors,  producing  r  length  output  packets 

^(m)  =  Fijt(m)  (oT  k  =  1,. . . ,  N  +  C  (2.2) 

Thus  F  is  an  r  X  p  matrix.  The  output  error  signals  can  then  be  computed  by 
subtracting  the  same  weighted  linear  combinations  of  the  outputs  from  the  N  working 
processors,  or  channels,  from  the  outputs  of  the  checksum  channels: 

N 

sj(m)  =  y.{m) {or  j  -  N  +  1, . . . ,  N  +  C  (2.3) 

fc=i 

For  the  ease  of  discussion,  the  time  index  m  will  be  omitted  from  here  on.  Since  these 
error  signals  may  be  used  for  fault  diagnosis,  they  will  be  referred  to  as  syndromes. 
If  all  processors  have  computed  their  results  properly  the  syndromes  should  be  zero, 
since 

N  N 

F  n  ^hk^^k  for  J  =  AT  -f  1, . . . ,  AT  4-  C 

*=1  k=i 

In  actuality,  the  processors  do  not  have  infinite  precision,  and  thus  truncation  or 
rounding  errors  can  be  expected.  These  errors  are  typically  small,  and  are  uniformly 
distributed  over  the  range  of  the  smallest  quantization  level  used  by  the  processors. 
Furthermore,  these  errors  due  to  limited  precision  occur  randomly,  and  may  be  as¬ 
sumed  to  be  independent.  By  the  central  limit  theorem  then,  in  the  absence  of  a 
processor  failure,  the  syndromes,  which  consist  of  a  sum  of  uniformly  distributed 
random  variables,  may  be  approximated  as  low  intensity  white  Gaussian  noise. 

To  calculate  the  syndromes  in  the  event  of  a  processor  failure,  assume  that  the 
output  from  processor  k  differs  from  its  correct  value  by  the  composite  error  [9]. 
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In  this  event,  equation  (2.3)  becomes: 


N 

i.j  =  ^  -  —  ^ ^o*se  for  j  =  N  +  l,...,N  +  C 

^  Jt=i 


(2.4) 


This  equation  can  be  simplified  by  defining  the  weights  in  the  checksum  processors 
to  be: 


I  —1  for  t  =  j  =  1,. . .  jC 

WN+i,N+j  =  S 

I  0  for  t  ^  j 


(2.5) 


so  that 


N+C 

Sj  =  -  51  ^j.k^k  for  j  =  iV  +  1, . . . ,  +  C 

fc=i 


(2.6) 


It  is  assumed  that  any  hardware  failures  occur  independently  in  the  N  +  C  pro¬ 
cessors,  so  that  at  most  one  working  or  one  checksum  processor  fails  at  a  given  time. 
Suppose  then  that  only  the  fcth  processor  fails,  so  that  only  is  nonzero.  Given  this, 
the  syndromes  will  have  values: 


s  = 


/  ^ 

(  T  \ 

iN+l 

rfAr+i.fcl 

^  ^N+C  ) 

^N+C,kl  j 

+  noise 


(2.7) 


where 

/  T 

'WN+lM 

is  called  the  A:th  weight  vector,  and  will  be  referred  to  as  "Wk. 

Note  that  a  checksum  channel  failure  can  be  distinguished  from  a  working  channel 
failure  if  at  least  two  of  the  weights  in  the  set  {ws+i,kt  •  •  •  >  ^N+c,k}  are  nonzero  for 
all  1  <  A:  <  iV  [2].  Thus,  a  checksum  channel  failure  results  in  only  one  non-zero 
syndrome,  whereas  a  working  channel  failure  will  result  in  at  least  two  non-zero 
syndromes. 


14 


2.1.2  Selecting  the  weights 

A  block  weight  matrix  can  be  defined,  composed  of  the  N  +  C  weight  vectors: 

ttfAf+i.iI  •••  — I  •••  0 

•  ,  •  •  •  • 

•  »  •  •  •  • 

•  •  •  •  •  • 

tww+c.il  •  •  •  “'at+c.nI  0  ...  —I 

where  each  block  is  r  x  r.  Naturally,  the  weights  should  be  chosen  so  as  to  reduce  the 
difficulty  of  computing  the  checksum  inputs  and  syndromes.  Small  integers  (0,  ±1, . . .) 
make  particularly  good  choices,  since  no  multiplication  is  needed  then.  A  desirable 
property  is  that  the  columns  of  W  form  a  “complete  set”,  wherein  if  (oi  02- ••Qc)^ 
is  a  weight  vector,  then  every  vector  of  the  form  (±q;i  ±  02  ■  •  •  ±  Qfc)^  is  also  either 
a  weight  vector  or  the  negative  of  a  weight  vector  [9].  This  causes  the  weight  vectors 
to  exhibit  a  useful  symmetry. 

There  are  various  selection  criteria  for  choosing  the  scalar  weights  Wj^k,  but  they 
should  be  chosen  so  that  it  is  possible  to  distinguish  between  failures  in  the  different 
working  processors.  In  case  of  failure  on  the  kih.  channel,  equation  (2.7)  implies  that 
the  syndromes  should  lie  on  a  line  defined  by  the  kth  weight  vector,  W*.  If  there  was 
no  rounding  or  truncation  noise  present,  the  syndromes  would  fall  perfectly  on  that 
line.  Due  to  errors  of  finite  precision,  however,  they  would  not  fall  directly  on  the 
line,  but  should  fall  close  by.  In  this  Ccise,  the  processor  closest  in  signal  space  to  the 
syndromes  would  be  declared  faulty.  Hence  it  is  desirable  to  maximize  the  angular 
separation  between  weight  vectors,  in  order  to  reduce  the  chance  of  a  misdiagnosis. 
If  the  inner  product  between  any  two  weight  vectors  is  defined  as: 

N+c 

^k,m  =  (2*9) 

j=N+l 

then  the  angle  $k,m  between  any  two  weight  vectors  k  and  m  may  be  defined  in  the 
usual  manner: 
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(2.10) 


To  illustrate  this  method  of  geometric  projection  fault  diagnosis,  a  simple  two  dimen¬ 
sional  weight  matrix  may  be  formed: 


(2.11) 


where 

C.  =  cos(^)  S„  =  sii.(y) 

Figure  2.3  shows  the  4  columns  of  (2.11)  assuming  a  block  size  oi  q  =  1,  where  the 
arrows  indicate  the  weight  vectors  in  two  dimensional  space.  The  angular  separation 


Figure  2.3:  Weight  vectors  in  2D  syndrome  space  with  fault  decision  regions. 


between  weight  vectors  here  is  45®.  Syndromes  falling  far  from  the  origin  naturally 
have  higher  energies,  and  probably  correspond  to  a  processor  failure.  Syndromes 
falling  near  the  origin  have  smaller  energies,  and  probably  correspond  to  mere  round¬ 
ing  noise.  Letting  Hjt  be  the  hypothesis  that  the  fcth  processor  has  failed,  figure  2.3 
shows  the  fault  decision  regions  for  the  four  different  processors.  The  angular  sep¬ 
aration  of  the  weight  vectors  may  be  further  increzised  by  using  complex  weights, 
but  at  the  cost  of  increased  complexity.  Furthermore,  adding  checksum  processors 
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increases  the  syndrome  space  thereby  allowing  greater  separation  ais  well,  but  at  a 
cost  of  increased  redundancy. 


2.1.3  Fault  detection  and  correction 

If  only  one  checksum  processor  was  used  in  figure  2.2,  then  only  one  syndrome  would 
be  formed.  In  this  case,  a  single  error  could  be  detected,  since  that  syndrome  would  be 
zero  except  in  the  event  a  processor  has  failed.  However,  it  is  possible  to  distinguish 
between  a  failure  on  processor  ki  and  processor  k2  only  if  weight  vectors  Wi  and 
Wa  are  linearly  independent.  This  requires  a  minimum  of  two  checksum  processors. 
Hence  it  is  possible  to  detect  and  correct  a  single  error  with  C  =  2. 

In  general,  in  order  to  detect  L  failures,  every  possible  set  of  L  weight  vectors 
must  be  linearly  independent,  for  otherwise  it  would  be  possible  for  a  combination 
of  L  failures  to  produce  all-zero  syndromes.  This  requires  L  dimensional  weight 
vectors,  and  therefore  L  checksum  processors.  In  order  to  detect  and  correct  up 
to  K  failures,  the  hyperplane  defined  by  all  possible  linear  combinations  of  a  given 
set  of  K  weight  vectors  cannot  intersect  with  the  hyperplane  defined  by  all  possible 
linear  combinations  of  any  other  set  of  K  weight  vectors,  except  at  the  origin  [9]. 
In  figure  2.3,  if  the  hyperplanes  are  taken  to  be  the  lines  represented  by  the  weight 
vectors,  it  is  clear  to  see  that  these  lines  intersect  only  at  the  origin.  So  here,  K  — 
which  is  the  dimension  of  a  line.  Detection  and  correction  of  up  to  K  faults  thus 
requires  a  2K  syndrome  space,  or  equivalently,  2K  checksum  processors. 

Therefore,  given  C  =  2K  +  L  additional  checksum  processors,  up  to  K  +  L 
simultaneous  failures  may  be  detected,  and,  if  no  more  than  K  have  occured,  can  be 
corrected,  provided  that  every  set  of  2K  -f  L  weight  vectors  are  linearly  independent. 
Note  that  the  fault  detection  and  correction  capacity  is  independent  of  the  number 
of  working  processors.  The  work  done  so  far  has  concentrated  on  a  particular  choice 
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of  W,  where  =  10  and  C  =  3: 


W  = 


I  I  I  0 
I  -I  -I  I 
-I  I  -I  I 


01  II  I  -I  0  0^ 

10  0  I  -I  0  -I  0 

-I  I  -I  0  0  0  0  -I  ^ 


(2.12) 


and  therefore  this  configuration  is  capable  of  double  fault  detection,  and  single  fault 
correction. 


2.2  Maximum  likelihood  estimation 

A  reasonable  approach  for  diagnosing  the  failure  is  to  measure  the  total  syndrome 
energy  and  compare  it  with  a  fixed  threshold,  7  [8]. 

If  it  is  above  the  threshold,  i.e.  if 

N+C 

i=A^+i 

then  a  failure  has  probably  occurred.  To  diagnose  the  failure,  hypothesize  that  pro¬ 
cessor  k  htis  failed,  with  fault  value  If  this  is  the  c«ise,  then  each  syndrome  Sj 
should  have  the  value  according  to  equation  (2.7).  The  energy  Ek  in  the 

difference  between  the  observed  syndromes  and  these  expected  values  would  then  be: 

N+C 

£  lUj  +  (2-13) 

j=Ar+i 

If  processor  k  has  actually  failed,  and  <(>  is  the  value  of  the  actual  failure,  then  Ek 

—  K 

will  be  on  the  order  of  the  round-off  noise.  Otherwise,  Ek  will  be  large.  Therefore, 
for  all  A/^  -f  C  processors,  estimate  the  failure  which  best  explains  the  observed 
syndromes  in  terms  of  a  failure  on  processor  k  in  &  least  squares  sense,  aissuming  all 
processor  failures  are  equally  likely: 

^  nunEk{<f>^)  (2.14) 

±k 
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Then  the  processor  k  most  likely  to  have  failed  is  the  one  that  yields  the  smallest 
difference  in  energy  between  the  observed  syndromes  and  their  estimated  values: 

fc  minf?/t(^.)  (2.15) 

Correcting  the  output  from  processor  k  using  the  estimated  fault  value  gives 

ki^yk-h  (216) 

Figure  2.4  shows  this  procedure  graphically  for  an  assumed  batch  size  of  q  =  1 .  The 
vector  is  the  projection  of  the  syndromes  s  onto  the  Arth  weight  vector  line, 

and  the  squared  distance  from  s  to  the  weight  vector  line  is 

iN+2 


Figure  2.4:  Graphical  interpretation  of  MLE  algorithm 

This  procedure  can  be  shown  [2,  9]  to  be  equivalent  to  the  following  computation¬ 
ally  efficient  algorithm: 

Compute  the  cross-correlations  pj^k  between  the  syndromes 
pj,k  =  SkSj  for  j,k  =  N +  +  C 
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Compare  syndrome  energy  with  threshold 


N+C 

If  ^2  Pk,k  ^  7»  then  FAILURE 

it=JV+l 


To  correct  the  failure,  compute  relative  likelihoods 


rk,k 


-W" 


P/v+i,Ar+i 


^  pN+C,N+l 


Pn+i,n+c 


Pn+c,n+c 


Wit 


N+C 

where  rk,k  =  $3 
i=N+i 


Choose  the  processor  k  with  the  largest  Z,]^,  (call  it  k). 
Correct  the  fault  : 


I  N+c 

E  ^Ik^i 


k.k  i=/tf+l 


(2.17) 


where  7  is  a  constant  threshold  which  can  be  set  to  achieve  a  desired  probability 
of  false  alarm,  and  the  L'k  are  proportional  to  the  negative  of  the  errors  Ek  plus  a 
constant.  The  reason  for  calling  them  relative  likelihoods  is  explained  below. 

It  is  noteworthy  to  point  out  that  this  least  squares  algorithm  can  be  shown  [9]  to 
be  optimal  under  some  relatively  simple  conditions.  It  is  already  known  that  errors 
due  to  finite  precision  arithmetic  have  been  modeled  as  white  Gaussian  noise,  with 
statistics  that  may  be  precomputed  based  upon  the  linear  function  F,  and  the  weights 
Wj^k‘  If,  additionally 


1.  Rounding  noise  on  each  sample  is  uncorrelated  with  the  signal  and  with  round¬ 
ing  noise  on  any  other  sample. 


2.  There  is  either  no  rounding  when  computing  the  input  checksums,  or  else  the 
linear  function  is  an  orthogonal  transform. 

3.  The  weight  matrix  forms  a  complete  set. 


then  under  these  conditions,  samples  of  the  syndromes  are  all  independent  and  iden- 


tically  distributed  [2].  It  will  be  seen  that  the  beamformer  implemented  with  this 
analytic  fault  tolerance  satisfies  these  conditions.  Additionally,  this  least  squares  al¬ 
gorithm  is  virtually  identical  to  the  statistically  optimal  Generalized  Likelihood  Ratio 
Test  (GLRT),  which  becomes  a  maximum  likelihood  criterion  under  the  assumption 
that  all  processors  may  fail  with  equal  probability.  The  are  relative  because  they 
are  proportional  to  the  relative  likelihood  of  a  failure  on  processor  k  compared  with 
no  failure  at  all.  Furthermore,  because  the  fault  tolerance  is  statistical  in  nature,  the 
probabilities  of  error  detection,  false  alarm,  and  misdiagnosis  may  be  computed  in 
terms  of  the  syndrome  variances  due  to  rounding  and  truncation  noise. 


2.3  Finite  precision  considerations 

Any  system  designed  to  use  the  fault-tolerant  algorithm  described  above  utilizing  cur¬ 
rent  signal  processing  technologies  would  benefit  from  employing  floating-point  arith¬ 
metic,  although  the  original  algorithm  considered  the  effects  of  fixed-point  arithmetic 
[9].  Floating-point  processing  carries  with  it  its  own  implications.  If  the  checksums 
are  computed  with  floating-point  arithmetic,  minimal  rounding  will  occur  in  com¬ 
parison  to  fixed-point  arithmetic,  and  the  energies  of  the  syndromes  will  typically  be 
many  times  smaller.  Therefore,  hardware  failures  become  easier  to  discover,  and  the 
rounding  noise  in  practice  does  not  cause  a  me«isurable  false  alarm  or  misdiagnosis 
rate. 

Errors  due  to  limited  precision  uncover  a  subtle  disadvantage  to  using  equation 
(2.17)  for  error  correction.  In  theory,  once  the  faulty  processor  h«is  been  identified, 
a  simple  linear  combination  of  the  syndromes  is  sufficient  to  estimate  the  fault;  sub¬ 
tracting  this  linear  combination  from  the  faulty  processor’s  output  yields  the  correct 
data.  In  practice  however,  a  fault  can  cause  the  floating-point  processor’s  output  to 
have  extremely  large  values,  near  the  extreme  limits  of  its  dynamic  range.  These  large 
values  are  then  carried  over  into  the  calculation  of  the  syndromes,  which  will  then  be 
very  large  as  well.  A  weighted  sum  of  these  large  valued  syndromes  may  then  cause 
an  overflow,  causing  trouble  in  the  fault  correction  step.  A  more  numerically  reliable 
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approach  to  fault  correction  is  to  estimate  the  correct  processor  output  directly  from 
the  observed  outputs  of  the  other  working  processors.  Combining  equations  (2.3)  and 
(2.17)  yields: 


j  /v+u  j  N+C  S 

^k,k  j=N+l  k,k  j=N+l  m=l 

j  N+C  N+C  j  N 

=  Kk-—  E  E  Kk^j.-^ym-  —  En.my„ 

'  k,k  j=N+l  m=N+l  '  k,k  m=l 


Vt 


1  N+C 

=  yjt  -  —  E  n.myr, 

*  fc.it  m=l 
1  N+C 

7T  ‘^k.m.y.rn 


'  k,k  m=l 

m^k 


(218) 


where  is  defined  in  (2.9).  This  formula  is  numerically  more  robust  than  (2.17), 
although  it  involves  about  {N  +  C)fC  times  more  computation. 


2.4  Hardware  prototype 

The  primary  advantage  of  analytic  fault-tolerance  is  that  the  computational  overhead 
is  generally  far  less  than  the  200%  overhead  required  by  traditional  triple  modular 
redundancy.  This  method,  however,  has  its  disadvantages  as  well.  Because  this 
technique  relies  upon  a  statistical  test  to  determine  which  processor  most  likely  failed, 
and  the  value  of  that  fault,  mistakes  can  occur  in  the  presence  of  unusually  high 
rounding  noise.  Faults  may  be  ignored,  may  be  declared  when  they  do  not  exist,  or 
may  be  misdiagnosed  as  another  processor’s  error.  Fortunately,  a  misdiagnosis  occurs 
for  the  most  part  when  the  error  is  small,  and  therefore  any  mistaken  “correction” 
will  most  likely  be  small  as  well.  An  additional  problem  is  that  a  failed  processor’s 
output  is  necessarily  corrected  from  the  syndromes,  which  are  themselves  corrupted 
with  round-off  noise.  Thus  the  reconstructed  output  contains  a  noisy  version  of  the 
missing  output. 

Additional  questions  arise  when  an  actual  implementation  is  considered.  Consid¬ 
eration  must  be  given  to  moving  data  in  and  out  of  the  N  +  C  processors,  as  well  as 
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performing  the  checksum  and  syndrome  calculations,  and  shipping  the  data  to  the 
MLE  processor  for  final  correction.  Substantial  control  and  processor  synchroniza¬ 
tion  is  required  throughout.  In  order  to  test  this  least  squares  algorithm,  a  hardware 
prototype  was  constructed  [2]  as  illustrated  in  figure  2.5,  using  commercially  available 
components. 


Figure  2.5:  MISF  algorithmic  fault-tolerant  testbed 

The  prototype  v/as  built  using  four  commercial  VE^32C  boards  from  Valley  Enter¬ 
prises,  each  housing  four  floating-point  DSP32C  digital  signal  processors  from  AT&T, 
for  a  total  of  sixteen  processors.  Each  processor  is  theoretically  capable  of  25MFlops, 
yielding  a  theoretical  total  of  400MFlops.  Figure  2.6  depicts  one  of  the  processor 
boards.  All  boards  have  VME  interfaces,  which  are  used  for  downloading  code  into 
the  processors,  for  parsing  synchronization  messages  between  the  host  and  the  pro¬ 
cessors,  and  for  other  board  control.  A  lOMHz,  16-bit  parallel  bus  provides  input  to 
all  four  boards,  feeding  data  into  an  input  FIFO  on  each  board  which  can  be  read  by 
any  processor.  A  separate  lOMHz  bus  can  be  driven  with  output  from  any  processor 
on  any  board. 

A  customized  I/O  board,  illustrated  in  figure  2.7,  configured  to  shunt  the  output 
bus  directly  to  the  input  bus,  allows  any  processor  to  send  data  to  up  to  four  processors 
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Figure  2.6:  Valley  VE-32C  processor  board  with  four  DSP32C’s,  VME  bus  interface, 
and  FIFO  I/O  bus. 


on  four  different  boards.  It  can  also  be  used  to  buffer  input  or  output  data  between 
the  host  and  the  fast  bus  systems  via  the  VME  bus.  A  Motorola  MVME147  VME 
board  with  a  MC68030  processor  acts  as  host  to  the  system,  and  is  connected  via 
Ethernet  to  a  Sun  3/260  development  system.  The  host  utilizes  the  VXWORKS 
operating  system,  which  interfaces  smoothly  with  the  SUN  UNIX  environment.  C 
code  can  be  compiled  for  either  the  host  or  the  DSP32C’s,  and  downloaded  using 
VXWORKS  and  some  custom  commands. 

Because  of  the  limitations  involved  with  shipping  the  data  between  processors, 
the  actual  layout  of  the  tasks  becomes  very  important.  A  generic  layout  showing  the 
placement  of  the  DSP32C’s  on  the  separate  VE-32C  boards  in  conjunction  with  the 
I/O  board  is  shown  in  figure  2.8.  As  it  will  be  seen,  the  actual  taisk  distribution  will, 
with  such  an  architecture,  strongly  influence  the  selection  of  the  weights. 


24 


Input  Bus 


VME  Bus 

(Connects  to  all  boards) 


Figure  2.8;  Layout  of  the  16  DSP32C’s  with  the  I/O  board. 
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Output  Bus 


Chapter  3 


Single  Input  Multi-Function 
(SIMF)  Fault  Tolerance 


3.1  The  structure 

Representing  a  fundamentally  different  signal  processing  architecture,  single  input 
multi-function  (SIMF)  fault-tolerance,  introduced  in  [1],  enables  a  wealth  of  applica¬ 
tions  previously  unattainable  with  MISF  fault-tolerance.  As  the  name  implies,  SIMF 
fault-tolerant  architectures  have  a  common  input  to  all  channels,  with  each  processor 
performing  a  different  linear  task  on  the  data.  Figure  3.1  illustrates  such  a  system. 

As  figure  3.1  shows,  SIMF  fault- tolerance  is  merely  a  transposition  of  a  MISF 
architecture.  The  checksums  now  compute  weighted  linear  combinations  of  the  linear 
functions,  as  opposed  to  the  data.  Letting  F/t  be  the  linear  operator  in  the  A:th 
processor  for  1  <  A:  <  iV  yields: 

=  iorj  =  N  +  l,...,N  +  C  (3.1) 

fc=i 

where  again,  the  wj^k  are  scalar  weights.  The  same  p  length  data  vector  z(m)  is  then 
sent  through  all  ^  -f  C  processors,  producing  r  length  output  vectors  j/^(m): 

|^(m)  =  Fifci(m)  for  k  =  I,. . .  yN  +  C  (3.2) 
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Figure  3.1:  SIMF  fault-tolerant  architecture. 

The  syndromes  then  remain: 

N 

Sj(m)  =  y  (m)  -  for  ;  =  +  1, . . . ,  +  C  (3.3) 

fc=i 

A  SIMF  configuration  enjoys  at  least  one  advantage  over  similar  MISF  architec¬ 
tures  when  it  comes  to  balanced  computations  through  the  checksum  channels.  MISF 
systems,  as  figure  2.2  illustrates,  must  first  compute  weighted  sums  of  the  input  pack¬ 
ets  before  processing  in  the  checksum  channels  can  begin.  A  SIMF  architecture,  by 
comparison,  is  capable  of  processing  data  through  the  checksum  channels  without 
delay.  It  may  turn  out,  however,  that  the  checksum  channel  operators  in  equation 
(3.1)  involve  significantly  more  computations  than  those  in  the  working  channels.  In 
this  case,  the  lack  of  delay  before  processing  in  the  checksum  channels  begins  may  be 
overshadowed  by  the  delay  in  the  processing  itself. 

This  is  not  the  c«ise,  however,  if  the  structure  of  the  operators  in  the  working 
channels  are  all  similar.  As  in  chapter  2,  if  the  input  and  output  vectors  have  length 
p  and  r  respectively,  then  the  linear  operator  F*  may  be  represented  as  an  r  x  p 
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matrix.  The  structure  of  the  individual  Ft’s,  not  surprisingly,  will  determine  the 
structure  of  the  checksum  channel  operators.  For  example,  assume  that  p  =  r  =  2, 
and  the  matrices  Ft  all  have  a  diagonal  structure  such  that; 


ioT  k  =  1, ...  ,N 


(3.4) 


where  ak,bk  are  arbitrary  scalar  variables.  In  this  case,  using  equation  (3.1),  the 
checksum  channels  become 

E£Li  0  ' 

0  llk=i^j.kh  J 

and  thus  after  precomputing  equation  (3.5)  for  all  C  checksums,  the  checksum  op¬ 
erators  will  require  the  same  number  of  multiplies  and  additions  when  applied  to 
the  data  as  the  working  channels,  and  the  processing  will  be  balanced.  When  the 
working  channel  operators,  Ft,  do  not  all  have  the  same  matrix  structure,  there  is 
no  guarantee  that  the  checksum  channel  operators,  Fj,  will  require  the  same  num¬ 
ber  of  multiplications  and  additions.  In  this  case  the  processing  is  not  balanced.  It 
should  be  noted  that  if  the  linear  operators  in  the  working  channels  are  time  varying, 
Fjt(<),  then  the  checksum  channel  operators  can  be  precomputed  only  if  a  closed-form, 
analytic  solution  exists  for  equation  (3.1). 


i  =  Ar-fl,...,iV-hC  (3.5) 


3.2  The  algorithm 

The  maximum  likelihood  estimation  and  correction  algorithm  of  section  2.2  computed 
cross-correlations  from  the  syndromes,  compared  the  sum  of  the  auto-correlations 
against  a  fixed  threshold  to  check  for  failure,  and  computed  relative  likelihoods  based 
upon  the  correlations  and  the  weight  vectors  of  equation  (2.12).  In  other  words,  the 
entire  algorithm  depended  only  upon  the  syndromes.  Since  the  syndromes  for  a  SIMF 
configuration  are  computed  exactly  as  they  are  for  a  MISF  configuration,  the  method 
remains  unchanged.  Therefore,  both  architectures  can  be  protected  by  the  same  fault 
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detection  and  correction  algorithm. 


3.3  Possible  applications 

3.3.1  Same  nature  data 

The  most  conceivable  applications  for  SIMF  fault-tolerance,  are  ones  in  which  the 
functions  differ  only  slightly,  and  the  nature  of  the  operation  the  working  channels 
are  implementing  is  the  same.  In  other  words,  the  matrix  structure  of  the  linear 
operators  are  the  same.  Two  examples  of  this  would  include  both  radar  and  sonar 
processing.  In  these  types  of  signal  processing  for  instance,  the  inputs  from  several 
sensors  are  delayed  in  a  predetermined  manner,  and  summed  to  determine  the  incident 
signal  strength  from  a  predetermined  direction.  The  working  channels  in  these  cases 
then  differ  only  by  the  delays  given  the  signals  of  the  various  input  sensors. 

3.3.2  Different  nature  data 

Not  all  of  the  data  broadcast  to  the  bank  of  processors  in  figure  3.1  need  be  used  by 
every  processor.  Each  processor  may  apply  its  own  window  to  select  the  portion  of 
the  input  composite  data  that  is  applicable  to  the  linear  operation  being  performed. 
Figure  3.2  illustrates  this  case.  The  configuration  depicted  in  this  figure  combines 
the  aspects  of  both  MISF  and  SIMF.  Not  all  processors  are  using  the  same  input 
packet,  although  the  same  composite  data  vector,  x,  is  sent  to  all  processors.  In 
this  case,  the  checksum  channels  would  receive  the  entire  composite  data  vector,  and 
the  various  windows  would  then  be  applied  in  the  weighted  sum  of  the  various  Ft’s. 
Furthermore,  the  matrix  structure  of  the  various  F^’s  may  or  may  not  be  the  same. 
If  the  matrix  structure  of  the  various  Ft’s  differs  from  working  channel  to  working 
channel,  it  is  quite  likely  that  the  checksum  channel  operators  will  require  additional 
multiplications  and  additions,  and  the  computation  will  not  be  balanced  throughout 
the  processors. 

A  potential  application  for  the  system  of  figure  3.2  can  again  be  taken  from  radar 
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Figure  3.2:  SIMF  fault-tolerant  architecture  for  composite  data. 


and  sonar  processing.  For  very  large  arrays,  sections  of  sensors  are  typically  grouped 
together  into  smaller  sub-arrays  in  order  to  ease  the  processing  burden,  where  Xf. 
represents  data  from  the  fcth  sub-array.  The  outputs  of  various  processors  can  then 
be  combined  to  reconstruct  the  full  array.  It  is  also  possible,  assuming  the  data 
bandwidths  are  the  same,  to  perform  radar  processing  in  one  or  more  processors,  sonar 
processing  in  others,  and  protect  them  all  with  the  same  set  of  checksum  processors. 
In  a  like  manner,  systems  that  were  previously  separate  can  be  joined  to  reduce  the 
amount  of  redundancy  required  to  make  them  fault-tolerant. 
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Chapter  4 


Beamformer  Application 


4.1  Multidirectional  fault-tolerant  beamforming 

Single  input,  multi-function  fault-tolerance  is  particularly  well  suited  for  multidirec¬ 
tional  sonar  beamforming,  where  the  linear  operator  has  the  same  structure  in  all 
channels,  and  the  data  need  not  be  windowed  prior  to  processing.  Beamformers  are 
essentially  spatial  filters,  getting  their  output  by  taking  the  weighted  sum  of  signals 
from  an  array  of  sensors.  Rather  than  consider  a  channel  to  consist  of  a  single  sensor, 
a  channel  will  instead  consist  of  the  entire  array  of  sensors.  In  this  way,  different 
channels  then  implement  different  time  delays  between  the  sensors,  thereby  steering 
the  beam  in  a  different  direction.  Figure  4.1  illustrates  this  point. 

Most  digital  beamformers  already  work  in  this  manner;  rather  than  “paint”  the 
area  of  interest,  the  beams  are  computed  in  parallel  to  give  an  instantaneous  beam 
pattern.  Ordinary  digital  beamformers,  however,  require  very  high  sampling  rates 
and  thus  place  severe  demands  upon  the  hardware,  and  do  not  have  the  benefits  of 
arithmetic  redundancy . 
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Multidirectional  Beamformer 

Figure  4.1:  Multidirectional  beamformer  with  N  look  angles. 

4.2  Digital  interpolation  beamforming 

Conventional  analog  time-domain  beamforming  may  be  represented  as: 

Ne 

Hi)  =  OnXnit  -  T„)  (4.1) 

n=l 

where  n  is  the  sensor  number,  Ne  is  the  number  of  sensors  in  the  array,  a„  is  the  nth 
shading  coefficient  of  the  spatial  window  applied  to  the  array,  and  t„  represents  the 
time  delay  required  for  the  nth  sensor  in  the  array  in  order  to  form  the  beam,  b{t). 
Figure  4.2  illustrates  this  conventional  beamformer.  In  this  particular  application,  no 
spatial  window  was  applied,  and  so  a„  =  1  always. 

Digital  time-domain  beamforming  involves  sampling  equation  (4.1)  but  places 
considerable  demands  upon  system  components  by  requiring  sampling  periods  several 
times  smaller  than  those  required  for  mere  waveform  reconstruction.  This  is  so  that 
the  time  delays  needed  to  form  the  beam,  typically  much  smaller  than  the  Nyquist 
sampling  period,  can  be  implemented.  Digital  interpolation  beamforming  relaxes  this 
requirement  by  allowing  the  data  to  be  sampled  at  the  Nyquist  period,  A,  and  then 
interpolating  the  data  prior  to  beamformation.  The  data  is  thus  upsampled  by  a 
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Sensor  Array  Beamformer 

Figure  4.2:  Analog  beamformer  and  sensor  array  of  Ne  hydrophones 


factor  of  L  to  a  new  sampling  period,  6,  such  that  6  =  A/L.  The  time  delays  needed 
for  beamformation  are  then  approximated  by  integer  delays,  Af„,  of  the  upsampled 
period,  6,  for  each  sensor  in  the  array.  The  accuracy  of  this  approximation  depends 
upon  how  accurately  M„6  approximates  t„,  where  A/„  is  the  rounded  integer  part  of 
the  quotient  A  beam  for  which 

T„  =  MnS,  n  =  1,..,  Ae  (4.2) 

is  called  synchronous  [12].  Due  to  the  linear  nature  of  this  equation,  it  is  expected 
that  a  synchronous  beam  would  only  be  possible  with  a  linear  array,  and  indeed  this 
is  the  case.  Any  bandp2iss  signal  with  bandwidth  W  centered  on  /„  and  limited  to 
the  frequency  interval 

{fo-W/2)<\w\<{fo  +  W/2)  for  W<f,  (4.3) 

may  be  represented  as 

x{t)  =  xi{t)cosuot  —  XQ{t)smuot  (4-4) 

where  x/(t)  and  i(}(t)  are  the  in-phase  and  quadrature  components  of  x,  respectively, 
band  limited  to  the  frequency  interval  |/|  <  VF/2,  and  Uo  is  the  angular  center 
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frequency  of  the  bandpass  signal  [5],  As  in  the  case  of  the  signal  waveform,  the  beam 
output  may  be  expressed  as 

b{t)  =  bi{t)cosu}ot  ~  bQ{t)smu}ot  (4.5) 

where  6/(t)  and  bQ{t)  are  the  in-phase  and  quadrature  components  of  b,  respectively. 
Combining  equations  (4.1)  and  (4.4)  yields: 

Ne 

b{t)  =  £  [xin{t  -  Tn)  cosa;o(t  -  t„)  -  XQ^{t  -  t„)  sinu;o(t  -  t„)]  (4.6) 

n=l 

where  xi„{t)  and  XQ„{t)  are  the  in-phase  and  quadrature  components,  respectively  of 
x„(/).  Equating  (4.5)  and  (4.6)  yields  the  following: 

^/(O  =  2  b/n(^  -  Tn)  COSijJoTn  +  XQ„(t  -  T„)  sinU?oT„]  (4.7) 

n=l 

Ne 

bqit)  =  ~  T„)sina;oT„  -  XQ„{t  -  T„)cosa;or„]  (4.8) 

n=l 

Hence,  the  in-phase  and  quadrature  components  of  the  beam  output  can  be  obtained 
from  the  in-phase  and  quadrature  components  of  the  sensor  outputs. 

4.2.1  Second-order  sampling 

Sampling  equations  (4.7)  and  (4.8)  requires  sampled  versions  of  xj{t)  and  XQ(t)  for 
each  sensor.  Rather  than  use  a  complex  demodulation  scheme,  these  samples  may 
be  obtained  directly  from  the  bandpass  signal  without  need  for  analog  multipliers 
or  filters  via  second-order  sampling.  With  this  method,  the  quadrature  components 
are  obtained  from  the  complex  envelope  in  pairs,  with  each  sample  period  yielding 
two  samples  separated  by  one  quarter  of  the  carrier  wavelength.  In  this  manner, 
one  sample  corresponds  to  the  in-phase  component,  and  the  other  corresponds  to  the 
quadrature  component.  Which  component  is  sampled  first  depends  upon  the  way  in 
which  the  second-order  sampling  is  implemented. 
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Figure  4.3  illustrates  the  second-order  sampling  method  for  obtaining  quadrature 
components. 

This  sampling  method  requires  a  sampling  period  of  A  where 

A  =  ^<VF-*  for  £=1,2,...  (4.9) 

In  Figure  4.3a,  the  signal  is  delayed  in  the  lower  path  and  both  samples  are  taken 
simultaneously,  yielding  [12] 

xi(mA)  =  x(mA) 

=  x/(mA)  cos(a;omA)  —  XQ{mA)  sin(u;omA) 

=  (-l)"‘'x/(mA)  (4.10) 


and 


X2(mA) 


:r(mA  - 

x/(mA  -  -^)  cos[a;<,(mA  -  ~)]  -  xqimA  -  -^)  sin[u>o(m A  -  -^)] 

^Jo  ^Jo  ^Jo  ^Jo 

(-ir'ig(mA  - -L)  (4.11) 

^Jo 


Here  an  implicit  approximation  is  made  by  assuming  XQ(mA  —  ^)  =  XQ(mA).  This 
approximation  is  valid  for  highly  narrowband  signals  which  can  be  expected  with  the 


x(t) 


V 


•X](mA) 


Time  Delay  'Jk _ ^ 

ll4fo  "  f — 
- '/,  =  A^  =  2/^/ 


x(t) 


•-<  w  Delay  = 

"-f  ,4. 


/,  =  =  2fall 


^xi\mA) 

^X2(mA) 


(a) 


(b) 


Figure  4.3:  Two  methods  of  performing  second-order  sampling,  (a)  by  delaying  the 
signal,  and  (b)  by  delaying  the  sampling  gate 
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sonar  system  considered  here  [10],  since  this  would  mean  a  complex  envelope  that 
varies  slowly  relative  to  the  modulating  frequency.  This  application,  however,  uses 
samples  obtained  as  in  Figure  4.3b,  where  instead  of  delaying  the  signal,  the  lower 
sampling  gate  is  delayed.  This  effectively  interchanges  the  order  of  the  components 
so  that  x[{mA)  =  (— l)’”^a:<}(mA)  and  X2(mA)  =  (— l)"*^x/(mA),  again  under  the 
same  narrowband  approximation.  Sampling  equations  (4.7)  and  (4.8)  with  period  A 
therefore  yields  : 

6/(mA)  =  [i/„(mA  -  t„)  coswoT„  +  XQ„(mA  -  t„)  sinwoTnj  (4.12) 

n=l 

Ne 

bQ{mA)  =  ^  [x/„(mA  -  T„)sina;or„  -  XQ„(mA  -  r„)cosa;oT„]  (4.13) 

n=l 

Henceforth  it  will  be  assumed  that  x/(mA)  and  XQ(mA)  are  available. 

4.2.2  Interpolation 

If  A  represented  a  sampling  rate  high  enough  for  digital  beamformation,  equations 
(4.12)  and  (4.13)  would  suffice.  That  is  not  the  case,  however,  and  the  samples 
must  be  interpolated  to  a  higher  sampling  rate  so  that  the  proper  delays  may  be 
implemented  by  shifting  the  signals  forwards  and  backwards  by  discrete  sample  bins. 
Because  they  have  a  limited  number  of  states,  and  because  they  have  linear  phase, 
FIR  filters  are  chosen  for  this  task.  Their  absence  of  recursion  allows  for  a  reduction 
in  the  required  computation,  by  making  it  possible  to  only  compute  the  output  points 
necessary.  If  the  filter  was  HR,  this  would  not  be  possible,  due  to  the  feedback  of  the 
outputs. 

Since  no  FIR  filter  can  ever  have  ideal  low-p«iss  filter  characteristics,  error  will 
be  introduced  at  the  beamformer  output.  This  finite  error  affects  the  time-filtered 
outputs  of  each  sensor,  which  in  turn  affects  the  space-filtered  beamformer  output, 
and  is  induced  because  the  frequency  response  of  the  filter  deviates  from  the  desired 
“box-car”  shape.  With  proper  filter  design,  this  error  can  be  minimized,  but  in 
general  the  beam  response  will  still  be  less  than  ideal  because  the  number  of  sensors 
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is  typically  limited. 

The  actual  sonar  chosen  for  this  application  was  the  WQS-1  obstacle  avoidance 
sonar  with  a  cylindrical  aperture.  The  necessary  specifications  for  the  WQS-1  are 
listed  in  table  4.1.  Other  sonar  implementations  have  utilized  interpolation  ratios  of 


Table  4.1:  Parameters  for  WQS-1  implementation 


Specification 

Variable 

Value 

frequency  of  modulation 

fo 

200  kHz 

radius  of  array 

WRAD 

6.875  in. 

angular  spacing  of  sensors 

ASPC 

3° 

assumed  signal  bandwidth 

W 

20  kHz 

complex  sampling  period 

A 

50  fis 

number  of  sensors  used 

Ne 

15 

interpolation  ratio 

L 

7 

4  and  10  [10,  12],  and  so  an  intermediate  value  was  chosen  for  this  application.  With 
the  use  of  the  Remez  exchange  algorithm  [7],  an  optimal  equiripple  FIR  filter,  h{m6) 
was  designed  with  the  following  characteristics: 


140  S 


15  1 

140  '  S 


where  /„  and  fi  are  the  upper  paissband  edge  and  lower  stopband  edge,  respectively. 
Equal  weight  was  given  to  both  the  passband  and  the  stopband  errors.  The  ripple  in 
the  passband  and  the  stopband  can  be  minimized  by  adding  more  coefficients  to  the 
filter,  at  the  cost  of  increased  computational  complexity.  The  number  of  coefficients, 
Nc,  used  for  this  application  was  63.  The  reason  behind  choosing  Nc  as  a.  multiple 
of  the  interpolation  ratio  will  become  apparent  in  the  next  section.  The  impulse  and 
magnitude  responses  are  shown  in  figures  (4.4)  and  (4.5)  respectively. 

The  first  step  to  take  in  interpolating  any  sequence  is  to  zero-pad  it.  Given  an 
interpolation  ratio  of  L,  the  zero-padded  data  sequence  may  be  represented  by  : 


i;/„(mA) 


m  =  0,±L,±2L,... 
0  otherwise 


(4.14) 
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where,  again,  A  is  the  sampled  period,  and  6  =  A/L  is  the  interpolated  period.  An 
analogous  construction  may  be  made  for  t;Q„(mA)  as  well.  Interpolating  the  data 
sequences  uses  a  discrete  convolution  so  that 

Nc-l 

xi„{m8)  =  h{k8)vi„[{m- k)6]  for  n  =  1, . . . ,  iV^  (4.15) 

fc=0 

where  again,  Xvn{m8)  is  formed  in  an  analogous  manner.  Given  that  the  time  delays, 
T„,  required  for  beamformation  are  approximated  by  integer  multiples  of  the  upsam¬ 
pled  data  period,  equations  (4.12)  and  (4.13)  can  be  approximated  appropriately  by 
shifting  the  interpolated  sequences: 

Nc 

6/(mA)  =  {i/„[(mZ/  -  M„)<5]  cosw^Tn  -1-  XQ„[{mL  -  A/„)(5]  sinu;oT„}  (4.16) 

n=l 

bqimA)  =  {xi„[{mL  -  A/„)^]  sinwoT„  -  XQ„[{mL  -  M„)(5]  cos WoT„}  (4.17) 

nssl 

and  by  combining  equations  (4.16)  and  (4.17)  with  (4.15),  the  following  equations 
result: 

N.  TNc-l 

b/(mA)  =  X/  ^  h{k8)vi^(^m A  —  M„8  —  k8)  cos u>oT„+ 

n=l  k=0 

Nc-l 

^  h{k8)vQ„{mA  —  M„6  —  A:6)sinu;oT„  (4.18) 

k=0 


bQ{mA)  =  Xrf  Xrf  b.{k8)v j^{m A  —  Mn8  —  k8)  sin (VoTn— 

n=l  k=0 
Nc-l 

h{k8)vQ„{m A  —  Mn8  —  k8)  cos  UoTn  (4-19) 

ifc=0 

where  bj{mA)  and  bglmA)  are  the  approximated  beam  sums. 

To  simplify  these  results,  it  is  possible  to  write  the  interpolated  data  values  as  a 
single  complex  point  t;„(mA),  such  that  v„(mA)  =  u/„(mA)  —  JVQ„(mA)  where  here 
u„(mA)  is  baseband.  The  conjugation  here  is  necessary  to  ensure  the  proper  signs  in 
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the  output  equations.  Using  Euler’s  equation  leads  to 

=  [v/„(mA)  cos  a;oT„  +  t;Q„(m  A)  sin  UoTn] 

sincJoTn  -  Vg„(mA)  COSWoTn]  (4.20) 

and  with  a  similar  baseband  complex  representation  for  the  in-phase  and  quadrature 
components  of  the  beam,  6(mA)  =  6/(mA)  +  jbQ{mA),  so  that  equations  (4.18)  and 
(4.19)  can  be  reduced  to 

6(mA)  =  53  h{kS)e^<^o^’'v4mA  -  M^S  -  kS)  (4.21) 

n=l  it=0 

4.3  Implementing  the  beamformer 

Using  the  hardware  described  in  section  2.4,  a  multi-directional  digital  interpolation 
beamformer  was  chosen  to  demonstrate  the  principles  of  SIMF  fault-tolerance.  The 
particular  layout  of  the  beams  is  as  illustrated  in  Figure  4.6,  where  there  are  N  = 
10  working  channels,  and  C  =  3  checksum  channels.  The  weights  chosen  for  the 
application  were  the  same  as  in  equation  (2.12). 

4.3.1  Calculating  the  time  delays 

Using  the  specifications  of  table  4.1,  it  is  possible  to  determine  the  time  delays  and 
their  integer  approximations  based  upon  the  geometry  of  the  array.  Figure  4.7  helps 
illustrate  how  these  values  are  determined.  It  is  reasonable  to  assume  that  an  incident 
waveform  may  be  approximated  as  a  plane.  The  steering  direction,  0,  is  then  normal 
to  the  incident  waveform  at  the  point  of  tangency.  To  steer  the  beam  to  this  direction 
then,  requires  a  series  of  time  delays,  t„,  for  each  sensor.  These  time  delays  can  be 
found  by  projecting  the  individual  sensors  from  a  mirror  image  of  the  array  onto 
the  incident  waveform.  Hence,  the  t„’s  are  determined  from  the  cosine  of  the  angle 
between  the  normal  to  the  incident  waveform,  and  the  sensor  of  interest.  With  this 
approach,  the  time  delays  would  be  zero  at  the  geometric  limits  of  the  array,  and 
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VME  Bus 

(Connects  to  all  boards) 


Figure  4.6:  Task  layout  for  N  =  10,C  =  3 
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Figure  4.7:  Determination  of  time  delays  for  a  signal  off  broadside 


would  be  maximized  at  the  point  of  tangency.  For  this  application,  however,  these 
values  of  t„  are  negated,  so  that  the  sensor  closest  in  angular  distance  to  the  steered 
direction  will  have  the  most  negative  value.  This  is  done  so  that  the  values  of  Mn 
may  be  subtracted  from  the  least  negative  integer  approximation  used  in  the  array; 
therefore  A/„  =  0  corresponds  to  By  doing  this,  the  values  of  A/„  will  be 

strictly  nonnegative,  and  will  be  referenced  to  the  limits  of  those  sensors  in  the  array 
used  in  the  calculation  of  the  beam  (±v?)-  By  comparison,  the  r„’s  are  referenced  to 
the  geometrical  limitations  of  the  array  {$  ±  90®  for  a  cylindrical  array  such  as  this). 

4.3.2  Polyphase  filter  implementation 

With  limited  memory  on  each  of  the  VE-32C  boards  (8K  4-byte  words  per  DSP, 
or  32K  bytes)  zero-padding  the  data  is  not  a  viable  option.  Fortunately,  this  is 
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not  necessary.  Equation  (4.21)  indicates  that  the  output  sequence  has  the  same 
sampling  period  as  the  original  input  sequences.  Figure  4.8  gives  a  block  diagram 
representation  of  equation  (4.21).  The  original  complex  signals,  sampled  with  period 
A,  are  upsampled  by  a  factor  of  L,  then  filtered  by  a  complex  FIR  filter.  This  is 
the  interpolation  and  sinusoid  weighting  step  combined.  Each  sensor’s  signal  is  then 
uniquely  delayed  by  an  integer  number  of  samples  at  the  upsampled  data  period. 
After  the  signals  have  been  delayed,  they  are  then  downsampled  to  the  previous 
period  A,  and  summed  to  form  the  beam.  The  interpolation  and  decimation  ratios 
need  not  be  identical,  but  since  they  are,  they  may  be  removed  altogether  by  using 
a  collection  of  polyphase  filters.  Given  a  full  order,  linear  phase  FIR  filter  h  of  order 
Ac  —  1,  a  series  of  smaller  order  polyphase  filters  gp(mA)  can  be  constructed  [3]  such 
that 

g(3{mA)  =  h{mSL  +  y3),  for  =  0, 1, . . . ,  Z,  —  1  and  V  m  (4.22) 

Figure  4.9  shows  the  implementation  of  a  polyphaise  filter  bank.  Here,  the  output  of 
the  /3th  path,  y^(mA),  corresponds  to  the  interpolation  output  sample  y{mSL  +  /3) 
obtained  by  applying  a  single  input  x(mA)  to  the  filter  bank.  In  other  words,  for  a 
particular  x(mA),  each  of  the  L  branches  of  the  filter  bank  contributes  one  nonzero 
output  (or  vector  of  outputs)  corresponding  to  its  family  of  output  samples.  Since 
the  integer  delays,  M„,  remain  constant  for  a  given  sensor  and  a  given  steering  angle, 
only  one  branch  of  the  polyphase  filter  bank  per  hydrophone  is  required  for  a  desired 
steering  direction.  This  makes  it  possible  to  not  only  remove  the  padded  zeros  in 
the  data  sequence,  but  to  implement  the  convolution  with  a  filter  1  /L  times  as  long, 


B(mA) 


Figure  4.8:  Digital  interpolation  beamforming  for  complex  input  sequences. 
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thereby  reducing  total  computation  per  sensor  by  an  order  of  for  each  steering 
angle. 

To  obtain  an  expression  for  in  terms  of  M„,  assume  for  the  moment  that 


Mn  <  L.  Figure  4.10  illustrates  this  case.  The  choice  of  Mn  is  based  upon  the 
number  of  steps  back  in  time  needed  for  a  particular  sensor  to  properly  delay  its 
interpolated  data  and  form  the  beam.  Conversely,  the  outputs  of  the  polyphase  filter 
bank  steps  forward  in  time  with  each  sample  sent  through  the  network.  Thus,  for 
the  case  where  Mn  <  L,  it  must  be  that  M„  +  /3n  =  L.  The  case  where  Mn  >  L 
corresponds  to  a  combined  whole  integer  delay  P„,  plus  the  fractional  delay  Pn  • 
Accordingly,  a  modulo  L  division  must  first  be  performed  on  Mn-  This  yields  the 


following  equations: 


(4.23) 


Figure  4.10:  Determination  of  fractional  delay 
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Table  4.2:  Subfilters  used  for  TV  =  10,  C  =  3  beamformer  application 


Values  of  for  all  N  working  processors 

n 

0  in  degrees 

-22.5 

-17.5 

-12.5 

-2.5 

4-2.5 

-h7.5 

-1-12.5 

-fl7.5 

-1-22.5 

1 

4 

4 

5 

0 

0 

0 

0 

0 

2 

4 

4 

5 

0 

0 

0 

6 

0 

0 

3 

4 

4 

5 

6 

0 

6 

6 

6 

6 

4 

4 

4 

5 

6 

6 

6 

5 

6 

6 

5 

4 

4 

5 

6 

6 

6 

5 

6 

5 

6 

4 

4 

5 

6 

6 

5 

5 

5 

5 

7 

4 

4 

4 

5 

6 

6 

5 

5 

5 

5 

8 
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I3n  =  [(Pn  +  1)  •  L  -  M„]  mod  L  (4.24) 

The  values  of  Pn  used  in  this  application  are  given  in  table  4.2. 

Using  this  fact,  the  inner  summation  term  of  equation  (4.21)  may  be  rewritten  so 
that 

Nc-i  1 

XI  h{k6)vn{TnA  -  M„6  -  kS)  =  g0„{kA)xr^{mA  -  kA)  (4.25) 

k=0  k=0 

where,  as  with  v„{m6),  x„{mA)  =  x/„(mA)  —  jxQ„{mA),  and  q  is  the  order  of  the 
/9„th  subfilter.  By  having  selected  A^c  as  a  multiple  of  the  interpolation  ratio,  q  is 
constant  for  all  I  <  n  <  N^. 

Now  applying  this  result  to  equation  (4.21),  the  same  results  are  achieved  without 
zero-padding  the  data. 

6(mA)  =  £  2  -  t)A)  (4.26) 

n=l *=0 

Thus  each  output  point  is  the  sum  over  all  sensors  in  the  array  of  the  resulting  discrete 
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time  convolution  between  each  sensor’s  complex  data  and  a  complex  FIR  filter.  Since 
all  values  are  now  .  ;enced  to  the  same  sampling  period  A,  it  will  be  omitted  for 
simplicity. 


4.3.3  Forming  the  checksum  responses 

An  FIR  filter  can  be  implemented  in  several  ways,  but  to  aid  in  the  representation 
of  input /output  relationships,  it  is  perhaps  easiest  to  view  an  FIR  filter  as  a  matrix 
multiplication.  To  do  this,  it  is  first  necessary  to  define  the  data  on  which  the  filter 
is  to  be  applied. 

Figure  4.11  shows  the  layout  for  a  data  bin  of  a  single  sensor  for  the  first  batch  of 
data.  Here  TV,  is  the  number  of  complex  data  samples  per  sensor  sent  in  a  given  batch 


Output  region 


TV, 


O  O  O  O  O  O  0(0 


1 


^  jo  o  o 


o  o  o  o  o 


Initial  Data  Batch 


Subsequent 

Batches 


Figure  4.11:  Data  bin  structure  with  q  +  b  point  overlap  region  and  encircled  filter 
window. 


of  data,  q  is  the  order  of  the  subfilters  as  before,  Pn,k  is  the  number  of  whole  integer 
delays  for  sensor  n  in  working  channel  fc,  and  b  is  the  maximum  number  of  whole 
integer  delays  required  across  the  sensors  to  form  a  particular  beam.  It  is  important 
to  note  that  since  this  application  involves  TV  =  10  different  beams,  b  must  be  chosen 
as  the  largest  of  the  maximum  delays  required  in  each  working  channel  so  that  the 
checksum  channel  responses  may  be  properly  formed.  In  this  manner,  the  size  of 
the  data  bin  remains  constant  for  all  TV  +  C  processors,  assuming  a  fixed  set  of  look 
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angles. 

Even  though  only  N,  points  get  sent  with  each  batch  of  data,  the  additional  pad 
points  in  the  data  bin,  where  pad  =  q  +  b,  are  necessary  to  maintain  a  continuous 
output.  In  the  absence  of  any  whole  integer  delays,  the  last  q  points  would  be  cycled  to 
maintain  a  continuous  output  through  the  use  of  overlap-save  convolution  [9].  When 
whole  integer  delays  are  introduced,  the  FIR  filter  is  implemented  over  a  delayed  set 
of  data  points,  initiated  on  the  first  batch  of  data.  If  a  particular  sensor  had  no 
whole  integer  delays,  and  only  the  last  q  points  were  cycled,  there  would  be  precisely 
6  points  of  convolution  overlap  unaccounted  for  in  the  next  filtering  operation  for 
that  sensor,  since  a  given  sensor  applied  to  a  given  beam  will  always  use  the  same 
subfilter.  Including  b  in  the  length  of  the  data  bin  allows  for  the  necessary  delay  in 
the  filtering  operation.  Including  b  in  the  number  of  data  points  needed  to  be  cycled 
after  each  convolution  guarantees  a  continuous  output  from  each  sensor. 

As  Figure  4.11  illustrates,  there  is  no  delay  compensation  necessary  after  the  first 
batch  of  data,  and  hence  the  coefficients  required  in  the  checksum  channels  after  the 
first  data  batch  are  merely  a  weighted  sum  of  the  filter  coefficients  in  the  working 
channels  on  a  per  sensor  basis  .  The  necessary  delay  comes  in  the  first  data  batch, 
and  must  be  accounted  for  to  properly  form  the  checksum  channel  beams.  The  data 
in  this  maxlen  point  data  bin,  where  maxlen  =  A^,  -f  pad,  can  be  represented  by 
means  of  the  vector  z„(m). 


^n(”^)  =  [xn{mNj  -  pad) . . .  z„(mA^,  +  (A^,  -  1))]^  (4.27) 

where  m  is  the  batch  number,  and  x„(t)  =  0  for  t  <  0.  Correspondingly,  a  similar 
output  vector  from  a  single  sensor  and  a  single  working  channel,  can  be 

defined: 

L.fc(»n)  =  -  pad) . . .  -f  (A,  -  1))]^  (4.28) 

where  output  values  preceding  m A,  —  6,  and  following  m A,  -f-  (A,  —  6  —  1),  are  invalid 
due  to  the  overlap-save  convolution  being  performed.  Thus,  each  z„(m)  generates 
valid  output  samples  6n,fc(^A^»  ~  b)  through  bn,k{ixiN,  -f-  (A,  —  6—1))  contained  within 
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6„^fc(m).  Written  as  a  discrete  time  convolution,  the  input /out  put  relationship  is  as 
follows: 

*n.fc(0  =  Yi  -  *)  ^  ~  +  (^*  ”  1)  (4-29) 

Jb=0 

where  the  phase  terms  have  been  temporarily  removed  for  convenience.  This  repre¬ 
sentation,  however,  fails  to  account  for  the  shift  applied  to  the  first  data  batch  for 
sensors  with  nonzero  whole  integer  delays.  This  delay  can  be  accounted  for  with  a 
matrix  multiplication  representation.  Define 

0  •••  0 

:  9Pn.M  0 

<//?„,*  (?)  :  : 

0  0 

0  •••  0  9P„A9)  •••  90n.k{^) 

where  G„,fc  is  a  {maxlen  —  Pn,k)  x  {maxlen  —  Pn,fc)  band  diagonal  matrix  operating 
on  the  desired  window  of  data  from  the  nth  sensor  on  the  fcth  processor  with  Pn,k 
defined  in  equation  (4.23).  Let  be  a  Pn,k  x  Pn,k  matrix  of  all  zeros.  Then  the 
input-output  for  the  nth  sensor  on  the  fcth  channel  can  be  written  as 

=  I  I  (4.31) 

SO  that  =  Fn,*(m)  •  x„(m).  It  now  becomes  a  simple  matter  to  find  the  total 

beam  response  of  a  given  channel,  6^(m),  rather  than  just  one  sensor’s  contribution.  A 
block  matrix,  F)t(m),  may  be  formed  so  that  F*(m)  =  [Fi,*(m), . . . ,  F;v„*(m)],  and 
a  block  data  vector,  x(m)  may  be  formed  as  x(m)  =  [xf(m),...,  Then 

equation  (4.26)  with  whole  integer  delays  properly  accounted  for  becomes 

bk{m)  =  Ffc(m)  •  x(m)  (4.32) 
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As  figure  4.11  illustrates,  Pn,k  niay  be  taken  as  zero  after  the  first  batch  of  data 
without  problem,  since  the  delay  has  by  that  time  already  been  implemented.  This 
means  that  F*(m)  is  constant  after  the  first  batch  of  data,  and  thus  the  checksum 
channel  operators  may  be  pre-calculated,  allowing  the  checksum  channel  calculations 
on-line  to  match  those  of  the  working  channels.  The  structure  of  the  F„,fc(0)  matrices 
becomes  very  valuable  in  determining  the  proper  filter  coefficients  required  by  the  C 
checksum  channels.  Figure  4.12  shows  a  weighted  sum  of  the  F„,fc(0)’s,  for  a  given 
n,  over  all  N  working  channels.  Again,  the  first  pad  rows(columns)  are  zero  for  this 


Figure  4.12:  Structure  of  checksum  operators  for  the  first  batch  of  data  and  a  single 
sensor. 

batch  since  the  signal  is  assumed  to  be  causal.  Note  that  the  only  shift  needed  is  at 
the  beginning  since  the  last  b  points  of  h„  ^{m)  are  not  output.  Here,  the  separate 
blocks  represent  the  various  individual  matrices  *  that  operate  on  specific 

windows  of  data  for  a  given  steering  angle. 

Note  that  the  sum  of  the  separate  F„,it(0)’s  is  band  diagonal  as  well.  The  lightly 
shaded  region  corresponds  to  the  nth  sensor  in  one  or  more  of  the  N  working  channels 
having  anything  but  a  maximal  number  of  whole  integer  delays.  This  region  has,  at 
most,  b  nonzero  columns  for  a  particular  sensor  in  the  checksum  channels.  The  heavily 
shaded  area  is  the  region  within  which  all  channels  (for  a  given  sensor)  contribute  to 
the  checksum  output.  Hence,  for  the  first  batch  of  data,  it  is  necessary  to  store  6-1-1 
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sets  of  complex  filter  coefficients,  per  sensor,  in  each  of  the  C  checksum  channels.  For 
subsequent  batches,  only  the  last  one  of  these  6+1  sets  will  be  used, 

4.4  Results 

The  beamformer  was  tested  with  forward-looking  sonar  data  collected  at  Lake  Cham¬ 
plain  [4].  Ten  ribbed  steel  drums  of  length  72  in.  and  diameter  21  in.  were  used  as 
test  targets.  The  acquired  data  consisted  of  16  synchronous  receive  staves  on  the 
AN/WQS-1  active  sonar,  of  which  15  were  used. 

Figure  4.13  shows  the  return  from  a  single  ping,  where  the  look  angles  of  the 
JV  =  10  processors  have  been  chosen  so  as  to  center  the  beams  on  the  target  return. 
This  image  shows  the  actual  return  from  the  10  processors  when  they  are  all  properly 
working.  Since  each  processor  computes  only  one  discrete  look  angle,  and  the  pro¬ 
cessor’s  are  all  separated  by  5°,  it  was  necessary  to  spatially  interpolate  the  outputs 
in  order  to  get  a  smooth  image.  As  figure  4.13  shows,  the  largest  returns  come  from 
+2.5®,  corresponding  to  processor  6,  The  target,  in  fact,  was  located  at  approximately 
3®  off  broadside.  As  is  shown,  the  largest  return  occurs  at  approximately  26.2  ms. 
The  smaller  returns  prior  to  this  resulted  from  some  assorted  cinder  blocks  that  were 
in  the  water  near  the  deployed  target. 

For  the  purposes  of  testing  the  maximum  likelihood  algorithm  of  section  2.2,  a 
processor  was  chosen  to  appear  as  though  it  had  failed  by  injecting  an  error  at  its 
output.  Three  types  of  constant  errors  were  allowed,  representing  separate  types  of 
failures.  The  options  consisted  of  adding  a  constant  to  the  data,  setting  the  data  to  a 
constant  value,  or  adding  a  constant  to  a  single  sample.  For  the  purposes  of  detecting 
the  faulty  processor,  the  latter  of  these  error  types  was  needed.  The  threshold,  7,  was 
set  so  that  a  single  bit  transient  error  on  a  single  processor  would  not  go  undetected. 
Once  detected,  the  error  was  corrected  using  the  numerically  robust  equation  (2.18). 
Once  the  threshold  is  set  to  detect  a  single  bit  error,  detecting  more  severe  errors 
becomes  easier.  Table  4.3  shows  the  relative  likelihoods  of  all  processors  when  various 
types  of  errors  are  injected  into  the  first  working  processor. 
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Figure  4.13:  Beamformer  response  when  all  processors  working 


Table  4.3:  Measured  relative  likelihoods  with  threshold  for  N  =  10,  C  =  3 
_ (boldface  indicates  largest  likelihood) _ 


Task 

Likelihoods 

No  fault 

1-bit  fault, 
sample  3, 

01 

1-bit  fault, 
all  samples. 

Set  real=0, 
imaginary=0. 

Threshold 

4.085e-04 

4.085e-04 

4.085e-04 

4.085e-04 

0i 

3,113e-12 

4.578e-05 

1.56e-03 

2.51e+00 

02 

1.98e-ll 

5.08e-06 

1.73e-04 

2.79e-01 

03 

3.47e-12 

5.09e-06 

1.73e-04 

2.79e-01 

04 

3.84e-ll 

5.09e-06 

1.73e-04 

2.79e-01 

0s 

1.90e-ll 

3.05e-05 

1.04e-03 

1.67e-}-00 

06 

5.34e-12 

8.19e-12 

2.04e-10 

5.96e-08 

07 

2.68e-12 

3.05e-05 

1.04e-03 

1.67e-|-00 

06 

4.14e-ll 

4.09e-ll 

5.82e-10 

5.96e-08 

09 

7.31e-12 

3.05e-05 

L04e-03 

1.67e-H00 

010 

2.15e-ll 

3.27e-ll 

5.53e-10 

1.19e-07 

CSPl 

2.43e-ll 

1.53e-05 

5.19e-04 

8.36e-01 

CSP2 

4.52e-12 

1.53e-05 

5.19e-04 

8.36e-01 

CSP3 

1.98e-ll 

1.53e-05 

5.19e-04 

8.36e-01 
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Given  that  a  transient,  single  bit  error  would  be  difficult  to  pinpoint  on  an  image 
intensity  plot,  a  much  more  catastrophic  error  was  chosen  to  demonstrate  the  fault- 
tolerance  of  the  beamformer.  In  case  of  a  real  processor  failure,  a  sonar  operator 
might  expect  that  a  component  failure  has  occured  somewhere.  In  either  case,  the 
output  would  be  erroneous.  To  simulate  this  type  of  error,  the  output  of  processor 
6  was  replaced  with  all  zeros.  Figure  4.14  shows  the  faulty,  or  corrupted,  output 
alongside  the  output  corrected  on  the  fly  after  the  computation  of  every  N3  =  34 
output  points. 

A  final  option  on  the  program  enables  the  injection  of  a  clipping  error,  where 
it  is  assumed  that  the  most  significant  bits  in  the  output  registers  of  a  processor 
have  failed,  clipping  the  output  at  a  certain  level.  Whereas  the  dramatic  error  of 
figure  4.14  may  be  easy  to  recognize  because  it  essentially  paints  a  stripe  down  a 
would  be  operator’s  display,  a  clipping  error  does  not  remove  background  noise,  but 
may  be  enough  to  remove  any  peak  returns.  Such  an  error  would  be  much  more 
difficult  for  an  operator  to  catch,  and  further  illustrates  the  value  of  the  system. 


4.5  System  challenges 

The  original  prototype,  built  to  test  the  concepts  of  MISF  fault-tolerance,  had  a 
number  of  constraints  due  to  the  use  of  off-the-shelf  components  [2].  Naturally,  by 
utilizing  the  same  hardware  the  same  constraints  existed  and,  in  addition,  several 
programming  challenges  appeared  throughout  the  development  of  the  beamformer. 

Although  the  DSP32C’s  are  themselves  very  fast,  they  are  constrained  by  the  16- 
bit,  iK-word  FIFO’s  on  the  VE-32C  boards,  which  limit  the  batch  size,  and  hence 
the  data  rate.  As  a  result,  with  —  15  sensors  per  channel,  only  34  complex  data 
samples  can  be  sent  for  each  sensor  with  a  given  batch.  With  the  addition  of  the 
extra  buffer  space  needed  in  external  memory  on  the  VE-32C’s  for  each  DSP32C  to 
perform  overlap  save  convolution,  the  overall  length  of  the  complex  data  buffers  is 
42,  of  which  8  points  (9-f  6  =  8)  must  be  cycled  after  each  batch.  Hence,  nearly  20% 
of  e2w:h  sensor’s  data  buffer  must  be  cycled  before  new  data  may  be  read  in.  This 
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reduces  the  efficiency  of  the  beamformer. 

An  additional  problem  with  the  VE-32C  architecture  is  that  only  one  DSP32C 
per  board  can  read  data  from  the  input  FIFO  at  a  time.  The  placement  of  the 
working  processors  and  the  checksum  processors  in  figure  4.6  is  capable  of  overcoming 
this  problem  when  configured  as  a  MISF  fault-tolerant  architecture.  In  this  case, 
advantage  is  taken  of  the  zero  weights;  if  weight  iWj.jt  =  0,  then  checksum  processor  j 
does  not  need  to  read  working  processor  k's  input.  This  elegant  layout  is  not  possible 
using  SIMF  fault-tolerance,  however,  since  all  processors  receive  the  same  input.  In 
this  case,  the  same  input  must  be  transmitted  four  times.  Many  problems  were 
encountered  in  the  initial  development  of  the  MISF  fault-tolerant  prototype  when 
attempting  to  read  from  and  write  to  the  FIFO’s.  It  was  found  that  these  problems 
resulted  most  directly  from  a  lack  of  synchronization,  more  precisely  when  a  DSP 
tried  to  read  from  a  FIFO  at  the  same  time  another  was  writing  to  it.  To  combat 
these  problems,  the  68030  microprocessor  host  was  set  up  to  control  all  transfers. 

Several  programming  problems  appeared  throughout  the  implementation.  The 
DSP32C’s  come  with  an  application  library  complete  with  several  functions,  though 
several  of  them  could  not  be  used.  The  version  of  the  DSP32C’s  used  for  this  applica¬ 
tion  had  a  problem  utilizing  a  particular  two-way  communication  link  between  them 
and  the  68030  host.  The  parallel  I/O  interrupt  register  (PIR)  is  set  up  to  “flag”  the 
DSP’s  when  an  operation  must  be  performed.  Unfortunately,  the  DSP’s  were  capable 
of  writing  to  this  register,  but  could  not  read  from  it.  This  made  it  impossible  to  use 
some  of  the  subroutines  in  the  application  library,  including  .fire  used  to  implement  a 
complex  FIR  filter.  To  overcome  this  communications  problem,  a  section  of  memory 
was  reserved  for  sending  and  receiving  commands  between  the  host  and  the  DSP’s. 
In  addition  to  these  problems,  the  compiler  used  for  the  DSP32C’s  was  an  earlier 
version,  and  had  a  relatively  small  symbol  table.  Errors  resulting  from  exceeding  the 
size  of  this  table  were  particularly  difficult  to  catch,  and  required  the  programs  to  be 
split  into  smaller,  separately  compiled  pieces. 
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4.6  Theoretical  overhead 


In  an  idealized  system  with  no  extraneous  overheads  for  memory  access,  communica¬ 
tion,  synchronization,  or  scheduling,  it  is  not  difficult  to  calculate  the  percentage  of 
overhead  dedicated  to  fault- tolerance  [2,  8]. 

Using  weight  matrix  (2.12)  with  a  batch  of  p  real  data  points  per  hydrophone,  cal¬ 
culation  of  the  syndromes  requires  p(7V-|-C)C7  =  39p  multiply-adds.  Using  symmetry, 
computation  of  the  cross-correlations  requires  p(C  -|-l)C/2  =  6p  multiply-adds.  Sum¬ 
ming  the  auto-correlations  to  test  for  failure  accounts  for  2  adds,  and  each  likelihood 
requires  —  1  =  S  additions  and  1  scaling.  In  addition,  there  are  13  comparisons 
between  the  likelihoods.  Correcting  the  output,  in  theory,  requires  up  to  Cp  =  3p 
additions,  and  p  scalings.  The  beamformers  implement  a  complex  FIR  filter  on  com¬ 
plex  data  prior  to  summing,  and  hence  require  qNep  =  120p  multiply-adds,  and 
{Ne  —  \)p  =  14p  additions.  There  are  C  =  3  processors  forming  redundant  beams. 
Since  the  GLRT  computations  are  not  protected  by  this  configuration,  they  would 
have  to  be  performed  in  triplicate.  The  ratio  of  work  done  by  the  N  =  \0  processors 
is  thus  about: 

A  ,  24p-h3(6p-H3-9-f  15-}-4p) 

10  10  •  134p  ^  ’ 

Because  beamforming  is  very  computationally  intensive,  the  theoretical  overhead  is 
fairly  minimal.  In  fact,  for  p  =  2Ng  =  68,  the  overhead  is  a  mere  36%. 


57 


Chapter  5 


Conclusions  and 
Recommendations  for  Future 
Work 


The  purpose  of  this  multidirectional  sonar  beamformer  implementation  was  to  utilize 
the  principles  of  multiple  input,  single  function  fault-tolerance,  and  apply  them  to 
a  different  configuration  consisting  of  a  single  input  with  multiple  functions  for  an 
actual  application.  This  application  was  then  to  demonstrate  that  the  usefullness  of 
analytic  fault-tolerance  is  indeed  much  greater  than  previously  acknowledged,  as  it 
clearly  did.  In  addition,  this  application  opened  the  window  to  a  more  generic  use  of 
analytic  fault-tolerance,  where  entirely  different  applications  may  be  joined  to  reduce 
the  overhead  necessary  for  error  detection  and  correction.  Combined  with  earlier 
results  [2],  the  results  from  this  thesis  show  that  the  maximum  likelihood  algorithm 
of  section  2.2  can  be  used  to  protect  a  combination  of  SIMF  and  MISF  configurations. 

The  digital  interpolation  beamformer  chosen  for  this  application  is  not  closed  to 
improvements.  Several  of  the  shorter  subroutines  could  be  replaced  with  more  efficient 
assembler  macros,  and  the  C-code  itself  has  not  been  optimized.  The  beamformer  ar¬ 
chitecture  implemented  here  itself  is  not  fault-tolerant,  but  serves  to  demonstrate  the 
principles  of  SIMF  fault-tolerance.  In  actuality,  if,  say,  one  of  the  VE-32C  boards  were 
to  fail,  that  would  account  for  4  processor  failures,  which  the  current  configuration 
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could  not  handle. 

As  a  topic  of  future  research,  the  concept  of  analytic  fault  tolerance,  either  for 
MISF  or  SIMF  (or  a  combination  of  both),  may  be  extended  to  nonlinear  systems 
through  the  use  of  homomorphic  analysis  [10]  or  linearization,  for  example.  As  a 
further  extension,  then,  these  nonlinear  systems  may  be  combined  in  a  particular  con¬ 
figuration  with  linear  systems,  offering  a  very  broad  range  of  analytic  fault-tolerance. 
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Appendix  A 
Source  Code 


Various  programs,  subprograms,  and  header  files  were  written  for  the  DSP32C’s  and 
the  68030  host.  The  usage,  along  with  any  dependencies,  are  listed  in  table  A.l  for 
clarity  with  the  actual  source  code  following. 


Table  A.l:  Source  files  listed  by  processor,  with  dependencies 


file 

processor(s) 

dependencies  and/or  include  files 

genb.c 

input 

genb32c.h,  libb.h 

mod_genb.c 

input 

dsp32c.h 

beam.c 

beam 

filter.c,  dsp32c.h 

csp.c 

csp 

filter.c,  dsp32c.h 

filter.c 

beam,  csp 

my  .macros,  h 

write.coefX.c 

beam 

dsp32c.h,  libb.h, 
fircoef.h,  myP.h 

write.cspZ.c 

csp 

dsp32c.h,  libb.h, 
fircoef.h,  myP.h 

beam_mle.c 

mle 

dsp32c.h 

write.weights.c 

mle 

dsp32c.h 

beamtrans.c 

68030 

rw.c,  util.c,  dspstruct.h, 
macros. h,  scrn.h 

dsp32c.h 

beam,  csp, 
input,  mle 

myjnacros.h 

Because  they  were  written  prior  to  work  on  this  application  began  to  facilitate 
memory  access  and  debugging,  rw.c  and  util.c  are  not  listed  in  this  appendix.  In 
addition,  genb32c.h,  fircoef.h,  and  myP.h  are  not  listed.  The  former  differs  from 
dsp32c.h  by  a  single  constant  redefinition,  and  the  latter  two  merely  contain  the  ma¬ 
trices  of  subfilter  coefficients,  and  whole  integer  delays  for  each  sensor  in  all  channels, 
respectively. 
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/«  GENB.C  -  generates  data  for  other  processors 

This  starts  by  waiting  for  the  host  to  send  Ns  via  COMBUF. 

It  generates  a  block  of  2Ne’i‘Ns  samples  of  fixed-point  complex  data 
(in  external  memory,  so  the  68030  can  examine  it  if  necessary). 

It  then  waits  for  a  GO.XMIT  from  the  68030,  eind  sends  the  data, 

1  block  at  a  time,  to  its  output. 

It  then  loops  back  and  repeats  forever. 

Acknowledges  are  returned  to  the  68030  via  PIR  as  soon  as  possible. 

COMMAND  BUFFER  USE: 

COMBUFO:  commands 

COMBUFl:  Ns  (#  complex  data  samples/sensor  to  generate  ) 

C0MBUF2:  most  recent  error  code 

C0MBUF3:  count  of  sets  of  packets  computed  so  far 

C0MBUF4:  count  of  packets  transmitted  so  far 

C0MBUF5:  b  (max  #  whole  integer  delays  in  working  channels) 


#include  "genb32c.h" 
tinclude  <libb.h> 

/♦  Define  Communications  buffer  assignments  */ 

#define  COM.CMD  0 
#define  COM.Ns  1 
#define  COM.ERR  2 
#define  COM.SET  3 
#define  COM.PACK  4 
#define  COM.b  5 

extern  void  put_PIR();  /♦  external  subroutine  to  comm,  with  68030  */ 
#define  Theta  0  /♦  steered  angle  off  broadside  ♦/ 

mainO 

int  count , Ns, Nout; 
register  int  i,j,nes; 

register  unsigned  short  ♦Port , command, *Combuf ; 
register  short  int  ♦In,*Ptr; 
float  cond,ti,tq,tau,tht; 
register  float  s_arg; 

/♦  Initialize  buffer  pointers  ♦/ 

In  =  (short  int  ♦)IBUF; 

Port  *  (unsigned  short  *)P0RT; 

Combuf  =  (unsigned  short  ♦) COMBUFO; 
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/♦  Initialize  PIR,  COMBUF  areas  ♦/ 
put_PIR(0) ; 

Combuf [C0M_CMD]  =  0; 

Combuf [COM.Ns]  =  0; 

Combuf [COM.ERR]  =  0; 

Combuf [C0M.SET]  =  0; 

Combuf [COM.PACK]  »  0; 

Combuf [COM.b]  =  0; 

Nout  *  0; 

Ns  =  0; 

tht  =  (float) (Theta*M_PI/ 180); 
nes  =  (int) (Ne/2) ; 
count  =  5; 

/*  Enter  the  main  loop,  waiting  for  commands  from  the  host  */ 
for(; ;){ 

I*  Check  if  new  data  is  needed  *! 
if (count  >  4)  { 

Ptr  =  In; 

for(count  =  1;  count<®4  ;  count++)  { 
i  ®  Ns; 

while (i — >0)  {  /♦  condition  the  data  ♦/ 

if  (mod((float)(LS*(Ns-l-i)),2.0)==0.0) 
cond  =  1.0; 
else 

cond  =  -1,0; 

/♦  quadrature  sampled  1st  ♦/ 
tq  =  (float) ((Ns-l-i)*DELTA); 

/*  in-phase  sampled  2nd  ♦/ 

ti  *  (float)(tq-  (1 .0/(4.0*Fc))); 

for  (j  =  -nes;  j  <=  nes;  j++) 

s_arg  =  (float) ((ASPC*j)-tht-M_PI_2) ; 
tau  ®  (float) (-mod_sin(s_arg)*WRAD/C) ; 
s_arg  ®  -mod_sin((float) (W0*(ti-tau))) ; 

*Ptr++  =  (short  int) (2048.0  ♦  s_arg  ♦  cond);  /♦  in-phase  */ 
s_arg  =  mod_sin((float)(W0*(tq-tau))) ; 

♦Ptr++  =  (short  int) (2048.0  ♦  s.arg  ♦  cond);  /*  quad.  */ 

} 

} 

} 

count  ®  1; 

Ptr  ®  In; 
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if (Ns  >  0)  { 

Combuf [C0M.SET3++; 

} 

> 


/*  Main  dispatch  loop  -  wait  for  command  in  COMBUFO  ♦/ 
for(;;)  { 

/♦  Extract  command  code,  dispatch  to  handler  ♦/ 

do{}  while ((command  =  *Combuf)  0  1 1  command  ==  ACK_ERR) ; 
i  =  (command  k  OxffOO) ; 

♦Combuf  =  0; 

/*  Do  the  dispatch  */ 
if(i  ==  GO.N)  { 

Ns  =  (int)  Combuf [COM.Ns] ; 

Nout  =  (int) (2*Ns*Ne) ; 
put_PIR(command) ; 
count  =  5; 
break; 

}  else  if(i  ==  GO.XMIT)  { 
for  (3*0;  3++  <  Nout;) 

♦Port  *  (short  int) (♦Ptr+t) ; 
put_PIR(command) ; 

Combuf [COM.PACK] ++; 
if(count++  ==  4) 
break; 

}  else  { 

Combuf [C0M_ ERR]  =  command; 
put_PIR(ACK.ERR) ; 

} 

} 

} 

} 

/♦  M0D_GENB.C  -  transmits  data  for  other  processors 

This  starts  by  waiting  for  the  host  to  send  Ns  via  COMBUF. 

It  then  waits  for  a  GO.XMIT  from  the  68030,  and  sends  the  data, 

1  block  at  a  time,  to  its  output. 

It  then  loops  back  and  repeats  forever. 

Acknowledges  are  returned  to  the  68030  via  PIR  as  soon  as  possible. 
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COMMAND  BUFFER  USE: 

COMBUFO:  commands 

COMBUFl:  Ns  (#  complex  data  samples/sensor  to  generate  ) 
C0MBUF2:  most  recent  error  code 

C0MBUF3:  count  of  sets  of  packets  computed  so  far 

C0MBUF4:  count  of  packets  trzuismitted  so  far 

C0MBUF5:  b  (max  #  whole  integer  delays  in  working  channels) 


#include  "dsp32c.h" 

/♦  Define  Communications  buffer  assignments  ♦/ 

#define  COM.CMD  0 

#define  COM.Ns  1 

#define  CQM.ERR  2 

#define  CQM.SET  3 

#define  COM.PACK  4 

#define  C0M_b  5 

extern  void  put_PIR() ; 

mainO 

int  batch, count , Ns, Nout; 
register  int  i,j; 

register  unsigned  short  ♦Port , command, *Combuf ; 
register  short  int  ♦In,*Ptr; 

/♦  Initialize  buffer  pointers  ♦/ 

In  =  (short  int  ♦)IBUF; 

Port  =  (unsigned  short  *)P0RT; 

Combuf  =  (unsigned  short  ♦) COMBUFO; 

/♦  Initialize  PIR,  COMBUF  areas  */ 
put_PIR(0) ; 

Combuf [COM.CMD]  =  0; 

Combuf [COM.Ns]  =  0; 

Combuf [COM.ERR]  =  0; 

Combuf [COM.SET]  =  0; 

Combuf [COM.PACK]  =  0; 

Combuf [COM.b]  =  0; 

Nout  =  0; 

Ns  =  0; 
count  ■  1; 
batch  =  -1; 


66 


/♦  Enter  the  main  loop,  waiting  for  commands  from  the  host  */ 
for(; ;){ 
count  =  1; 
if (Ns  >  0)  { 
batch  +=  1; 

Combuf [COM.SET] ++ ; 

} 


/*  Main  dispatch  loop  -  wait  for  command  in  COMBUFO  ♦/ 
for(;;)  { 

/♦  Extract  command  code,  dispatch  to  handler  ♦/ 

do{}  while ((command  =  ♦Combuf)  ==  0  I  I  command  =*  ACK_ERR) ; 
i  =  (command  ft  OxffOO) ; 

♦Combuf  =  0; 

/♦Do  the  dispatch  */ 
if(i  ==  GO.N)  { 

Ns  =  (int)  Combuf [C0M_Ns] ; 

Nout  =  (int) (2^Ns^Ne) ; 
put_PIR(command) ; 
break; 

}  else  if(i  =«  GO.XMIT)  { 

Ptr  =  In  +  batch^Nout; 
for  (j=0;  j++  <  Nout;) 

♦Port  =  (short  int) (♦Ptr++) ; 
put_PIR (command) ; 

Combuf CC0M_PACK]++; 
if(count++  ==  4) 
break; 

)•  else  { 

Combuf [COM.ERR]  =  command; 
put_PIR(ACK.ERR) ; 

} 

} 

} 

> 

/♦  BEAM.C(CSP.C)  -  Computes  a  beam  response  of  the  input  data 
in  the  working (checksum)  processors,  sends  to  output. 

This  starts  by  waiting  for  the  host  to  send  Ns,  b  via  the  COMBUF. 
It  waits  for  G0_RCV  from  the  host,  then  reads  multiple  blocks  of 
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2Ne  samples  of  fixed-point  data  from  its  input  FIFO.  (Puts  data 
in  external  memory,  so  the  68000  can  examine  it  if  necessary) . 

It  converts  to  floating  point,  interpolates,  then  finds  the 
on-aucis  value. 

If  it  is  chosen  as  the  faulty  processor,  it  then  injects  the  const, 
error  into  the  output  buffer. 

It  performs  overlap-save  convolution  by  maintaining  a  circular 
data  buffer. 

It  then  waits  for  a  GO.XMIT  from  the  68000,  and  sends  the  data  in 
floating  point  format,  16-bits  at  a  time,  to  its  output  FIFO. 

It  then  loops  back  and  repeats  forever. 

Acknowledges  are  sent  to  the  68000  as  soon  as  possible 

COMMAND  BUFFER  USE: 

COMBUFO  :  commands 

COMBUFl  :  Ns  (#  complex  data  samples/sensor  for  each  batch) 

C0MBUF2  :  last  error  command 

C0MBUF3  :  count  of  sets  of  packets  computed  so  far 
C0MBUF4  :  fault  type,  sent  from  host 

C0MBUF5  :  b  (max  #  of  whole  integer  delays  in  working  channels) 

C0MBUF6  :  1  if  faulty  processor,  0  otherwise 

C0MBUF7  :  failure  sample,  sent  from  host 

COMBUFOf:  real  fault  value 

COMBUFlf :  imaginary  fault  value 

♦/ 

#include  "dsp32c.h" 

#include  <libb.h> 

/♦  Define  communications  buffer  assignments  */ 

#define  C0M_Ns  1 
#define  COM.ERR  2 
#define  COM.SET  3 
#define  COM.FAULT  4 
#define  COM.b  5 
#define  COM.PROC  6 
#define  COM.SAMP  7 
extern  void  put_PIR(); 

mainO 

{ 

short  int  errproc,  errflag,  samp; 

int  Ns,  b,  nes,  maxlen,  pt,  pad,  Nout,  points; 

register  unsigned  short  int  *Port, command, *Combuf; 
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register  short  int  In,  ♦beam.io; 
register  int  start,  n,  m; 

COMPLEX  ♦Data,  *BeaiB,  ♦magn; 
float  *Mag,  *freal,  ♦fimag; 

/♦  Initialize  pointers  ♦/ 

Port  =  (unsigned  short  int  ♦)P0RT; 

Data  =  (COMPLEX  ♦)IBUF; 

Beam  =  (COMPLEX  ♦)BEAM; 
magn  =  (COMPLEX  ♦)BBUF; 

Mag  =  (float  ♦)MBUF; 
freal  *  (float  ♦) (COMBUFOf ) ; 
fimag  *  (float  ♦) (COMBUFlf ) ; 

Combuf  =  (unsigned  short  ♦)C0MBUF0; 

/*  Initialize  PIR,  COMBUF  areas  */ 
put.PIR(O) : 

for  (n=0;  n<12;  n++) 

Combuf [n]  =  0; 

errflag  *  samp  =  0; 
nes  *  (int) ((int) (Ne)/2) ; 

errproc  =  Nout  =  maxlen  =  points  =  start  =  pad  =  0; 

/*  Main  dispatch  loop  -  wait  for  commauid  in  COMBUFO  */ 
for(;;)  { 

/♦  Extract  command  code,  dispatch  to  handler  */ 
do{}  while ((command  =  *Combuf)  ==  0  | I  commemd  ==  ACK_ERR) ; 
m  =  (command  ft  OxffOO) ; 

♦Combuf  =  0; 

/♦  New  Value  of  Ntot  ♦/ 
if(m  ==  GO.N)  { 

Ns  *  (int)  Combuf [COM.Ns] ; 
b  =  (int)  Combuf [C0M_b] ; 
errproc  =  Combuf [COM.PROC] ; 
errflag  =  Combuf [C0M_FAULT] ; 
saunp  =  Combuf  [COM.SAMP]  ; 

Nout  =  (int)(Ns^4); 
pad  =  (int) (ORDER  +  b) ; 
maxlen  =  (int) (Ns  +  pad); 
if  (Combuf [COM.SET]  ==  0)  { 
points  =  (int) (Ns); 
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start  =  (int) (pad) ; 

for  (m  =  0;  m  <  pad;  m++)  { 

Beam [m] .real  «  o.O; 

Beam[m] .imag  ^  0.0; 

} 

} 

put_PIR(command) ; 

/*  Receive  and  process  new  packet.  Convert  to  Floating  Point  */ 

}  else  if(m  ==  GQ.RCV)  { 
for  (m  =  0;  m  <  Ns;  m++)  { 
for  (n  =  0;  n  <  Ne;  n++)  { 

pt  =  (int) (n*maxlen  +  pad  +  m) ; 

In  =  (short  int) (♦Port) ; 

Data [pt] .real  =  (float)(In); 

In  =  (short  int) (♦Port) ; 

DataCpt] . imag  =  -(float) (In) ;  /♦  X  =  Xr  -  jXi  ♦/ 

} 

} 

/♦  Digital  Interpolation  Beamforming,  Data  Cycling  ♦/ 

for  (n  =  0;  n  <  Ne;  n++)  {  /♦  Interp.  k  Form  Beam  ♦/ 

pt  =  (int) (n^maxlen) ; 

f ilter(4Data[pt+start] ,ftBeam[start] , points, n,b,Combuf [COM.SET]) ; 
for  (m  =  0;  m  <  pad;  m++)  /♦  Cycle  data  ♦/ 

DataCpt]  =  Data[Ns+(pt++)] ; 

} 

if  (errproc  >  0)  { 

for  (m  =  0;  m  <  Ns;  m++)  < 

if  ((errflag==l)  I  I  ((samp==m)  kk  (errflag==3)))  { 
magn[m].real  +=  ♦freal;  magn[m].imag  +=  ♦fimag; 

}  else  if  (errflag=®2)  { 

magn[m].real  =  ♦freal;  magn[m].imag  =  ♦fimag; 

} 

} 

} 

/♦  Subsequent  filters  over  all  maxlen  pts.  ♦/ 
points  =  (int) (maxlen) ; 
start  *  0; 
put_PIR(command) ; 

Combuf [C0M_SET]++; 

/♦  Transmit  finished  packet,  16  bits  at  a  time  ♦/ 

}  else  if(m  ==  GO.XMIT)  { 
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beam.io  =  (short  int  ♦) (BBUF) ; 

♦Mag  =  0.0; 

for  (iB=0;  m  <  Ns;  m++)  { 

♦Mag  +=  (float) ((magnCm] .real  ♦  magn [m] . real  + 

magn [m] . imag  ♦  magnCm] . imag)/Ns) ; 
for  (n=0;  n  <  4;  n++) 

♦Port  =  ♦beaiQ_io++; 

} 

put_PIR(coinmand)  ;  /♦  acknowledge  quickly  ♦/ 

}  else  { 

Combuf [COM.ERR]  =  command; 
put.PIR(ACK_ERR) ; 

> 

} 

} 

^include  "my_macros .h" 

void  f ilter( in, out, N, sensor, b, batch)  /♦  subroutine  to  implement  ♦/ 
COMPLEX  ♦in,  ♦out;  /♦  complex  FIR  filter.  ♦/ 

int  N,  sensor,  b; 
unsigned  short  batch; 

COMPLEX  ♦fir,  ♦coef,  sum,  ♦data  *  in; 
register  int  t,  k,  n  =  0; 
int  step; 

step  =  (int) (FILTBUF  +  sensor^(b+l)^LEN^8) ;  /♦  8  bytes/complex  pt.  ♦/ 
fir  =  (COMPLEX  ♦)(step); 

do  { 

if  (sensor  ==  0)  •{ 

out->real  ®  0.0;  /♦  set  output  to  0  if  1st  hydrophone  */ 

out->imag  =  0.0; 

} 

in  =  data  +  n;  /♦  . . .reset  data  ptr  ♦/ 

k=  (int) (min(n, (ORDER+b))) ; 
t  =  (int) (max(0, (n-ORDER))) ; 
sum. real  =  0.0;  sum. imag  =  0.0; 
if  (n  <  (ORDER+b)) 
do  { 

if  ((k  >*  b)  II  (batch  >0)) 
coef  =  fir  +  b^LEN  +  (n-k) ; 
else 
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} 


coef  *  fir  +  k*LEN  +  (n-k) ; 
sum. real  +=  (float) (((in->real  ♦ 

(in->imag  * 
sum.imag  +=  (float)  (((iii->real  ♦ 

(in->imag  * 


coef->real)  - 
coef->imag))) ; 
coef->imag)  + 
coef->real))) ; 


in--; 

}  while  (k —  >  t) ; 
else  { 

coef  *  fir  +  b*LEN; 
do  { 

sum. real  +=  (float) (((in->reaJ.  *  coef->real)  - 

(in->imag  ♦  coef->imag))) ; 
sum.imag  +=  (float) (((in->real  *  coef->imag)  + 

(in->imag  ♦  coef->real))) ; 


in — ;  coef++; 

}  while  (k —  >  b) ; 

} 

out->real  +=  (float) (sum. real/Ne) ;  /♦  normalize  by  Ne  ♦/ 
out++->imag  +=  (float) (sum. imag/Ne) ; 

}  while  (++n  <  N) ; 


/♦  WRITE.COEFX.C  :  Computes  complex  filter  coefficients  needed  for 

working  processor  #X,  and  writes  to  external 
memory  to  be  used  by  BEAN.C 

*/ 


#include  "dsp32c.h" 

#include  <libb.h> 

#include  "fircoef.h"  /*  matrix  of  subfilter  coefficients  ♦/ 
#include  "myP.h"  /♦  matrix  of  whole  integer  delays  ♦/ 


/♦  Define  communications  buffer  assignments  */ 

#define  T  X  /♦  selects  look  angle  #X:  theta  =  -22.5  +  T*5  */ 
mainO 

int  diff,  step,  nes,  Q,  M,  M_max; 

register  int  j,  n,  m; 
register  float  tau,  arg; 

COMPLEX  ♦filt,  ♦Fir,  ♦Coef,  phase; 
float  mx.  Theta; 
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/♦  Initialize  pointers  */ 

Coef  =  (COMPLEX  *)FILTBUF; 

Theta  =  (float) (theta[T]*M.PI/180) ; 
nes  ®  (int) ((int) (Ne)/2) ; 
step  =  (int) (b  +  1) ; 

mx  =  (float) (ASPC*nes)  +  (float) (abs (Theta))  -  (float) (M_PI_2) ; 

=  round ( (float ) (WRAD*mod_s in (mx)/(C*DEL))) ; 
for  (n  *  0;  n  <  Ne;  n++)  { 

Fir  =  Coef  +  n*step*LEN; 

arg  =  (float) ((n  -  nes)*ASPC  -  Theta  -  M_PI_2); 
tau  =  (f loat) ((WRAD/C)*nod_sin(arg)) ; 
arg  =  (float) (W0*tau) ; 

phase. real  =  -mod_sin(arg-M_PI_2) ;  /*  =  cos (arg)  */ 

phase. imag  =  mod_sin(arg) ; 

M  =  M_max  -  round (tau/DEL) ; 

if  ((Q  =  (int) ((P[T] [n]+l)*L  -  M))  ==  (int)(L)) 

q  =  0; 

for  (j  =  0;  j  <  step;  j++)  { 
filt  =  Fir  +  j*LEN; 
diff  *  (int)(j  -  P[T][n]); 
if  ((diff  >=  0)  kk  (diff  <=  b))  < 
for  (m  =  0;  m  <  LEN;  m++)  { 

filt->real  =  (f loat) (f ircoef [Q] [m]  *  phase. real); 
filt->imag  =  (f loat) (f ircoef [Q] [m]  ♦  phase. imag); 
f ilt++; 

} 

} 

else  { 

for  (m  =  0;  m  <  LEN;  m++)  ■( 
filt->real  =  0.0; 
filt->imag  =  0.0; 
f ilt++; 

} 

} 

} 

> 

} 

/♦  WRITE_CSPZ.C  :  calculates  the  summed  complex  filter 

coefficients  for  CSP  #Z,  and  writes 
them  to  external  memory  for  use  by  CSP.C 

*/ 

#define  N  10  /*  number  of  working  channels  */ 
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/*  weights  used  for  Zth  CSP  ♦/ 

int  wt[N]  “  {1,  1,  1,  1,  0,  0,  1,  1,  1,  1}; 

#include  "dsp32c.h" 

#include  <libb.h> 

•include  "fircoef.h"  /♦  matrix  of  subfilter  coefficients  */ 

•include  "myP.h"  /♦  matrix  of  whole  integer  delays  ♦/ 

mainO  { 

int  nes,  Q,  M,  M.max,  step,  diff; 
register  int  j,  k,  m,  n; 
register  float  tau,  arg; 

COMPLEX  *fir,  ♦coef,  ♦filt,  phase; 
float  mx,  tht; 

coef  =  (COMPLEX  ♦)FILTBUF; 
nes  =  (int) ((int) (Ne)/2) ; 
step  =  (int)(b  +  1); 

for  (n  =  0;  n  <  Ne;  n++)  { 
fir  =  coef  +  n*step*LEN; 
for  (k  =  0;  k  <  N;  k++)  { 

tht  *  (float) (theta [k]  ♦  M.PI/180); 

mx  *  (float) (ASPC^nes)  +  (float) (abs(tht))  -  (float) (M_PI_2) ; 

M.majt  =  round((float) (WRAD*mod_sin(mx)/(C*DEL))) ; 

arg  =  (float) ((n  -  nes)*ASPC  •  tht  -  M.PI_2) ; 

tau  =  (float) ((WRAD/C)*mod_sin(arg)) ; 

arg  =  (float) (W0*tau) ; 

phase. real  =  -mod_sin(arg-M_PI_2) ;  /*  =  cos (arg)  ♦/ 

phase. imag  =  mod_sin(arg) ; 

M  =  M_mauc  -  round  (tau/DEL) ; 

if  ((Q  =  (int) ((P[k] [n]  +  1)*L  -  M))  ==  (int)(L)) 

q  =  0; 

for  (j  =  0;  j  <  step;  j++)  { 
filt  =  fir  +  j*LEN; 
if  (k  ==  0)  { 

for  (m  *  0;  m  <  LEN;  m++)  { 

filt->real  =  0.0;  filt->imag  =  0.0; 
f ilt++; 

} 

filt  =  fir  +  j*LEN; 

}  /*  END  if  (k  ==  0)  ...  ♦/ 

diff  =  (int)(j  -  P[k][n]); 
if  ((diff  >=  0)  tt  (diff  <=  b)) 
for  (m  =  0;  m  <  LEN;  m++)  { 

filt->real  +=  (float) (fircoef [Q] [m]  ♦  phase. real  *  wt[k]); 
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filt->imag  +=  (float) (fircoef[Q] [m]  *  phase. imag  *  wt[k]); 
filt++; 

}; 

}  /*  END  for  (j  ■  0;  j  <  step;  j++)  . . .  ♦/ 

}  /*  END  for  (k  =  0;  k  <  N;  k++) 

>  /*  END  for  (n  *  0;  n  <  Ne;  ii++)  . . .  ♦/ 

}■  /*  END  main  ♦/ 

/♦  BEAM.MLE.C  -  Computes  syndromes,  syndrome  cross-correlations, 
likelihoods,  picks  the  most  likely  failure  ...  and  corrects  it. 

(N  »  10,  C  =  3) 

This  starts  by  waiting  for  the  host  to  send  Ns  to  determine 
batch  size  via  CQMBUFl. 

It  collects  13  input  packets  from  the  FIFO.  For  each  packet,  it 
waits  for  G0_RCV  from  the  host,  then  reads  Ns  complex  floating¬ 
point  data  points  from  its  input  FIFO,  storing  them  in  IBUF. 

It  acciimulates  the  syndromes  as  the  packets  arrive,  scaling  by 
1/16  to  avoid  overflow. 

After  computing  the  syndromes,  it  calculates  the  syndrome 
cross-correlations,  storing  them  in  the  rho  variables. 

It  then  computes  the  likelihoods,  auad  picks  the  maximum. 

It  computes  a  threshold  set  to  1/2  the  expected  energy  for  a 
one-bit  error  on  the  input .  Depending  on  whether  or  not  the 
likelihood  is  larger  than  this,  it  corrects  the  error. 
Acknowledges  are  sent  to  the  68030  as  soon  as  possible. 

COMMAND  BUFFER  USE; 

COMBUFO:  commands 

COMBUFl:  Ns  (from  host:  #  complex  samples  per  hydrophone) 

C0MBUF2:  last  error  command 

C0MBUF3:  count  of  sets  of  packets  computed  so  far 

C0MBUF4:  count  of  packets  computed  so  far 

CDMBUF6:  Most  likely  failed  processor,  or  no  fault  (0) 

C0MBUF7:  Most  likely  failed  processor 

*/ 

♦include  ”dsp32c.h" 

/♦  Define  communication  buffer  assignments  ♦/ 

♦define  CDM.CMD  0 

♦define  COM.Ns  1 

♦define  COM.ERR  2 

♦define  COM.SET  3 

♦define  COM.PACK  4 
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tdefine  CON.DECIDE  6 
#define  CON.NLE  7 

extern  void  put_PIR() ; 
float  scale  =  1.0; 

mainO  { 

register  \insigned  short  int  ’•■Port .command, *Combuf; 
register  short  int  ♦Inptr; 
register  float  maixL,  ’•‘Mag; 

COMPLEX  ♦In,  *0ut,  *31,  *32,  *33; 

float  ♦R,  ♦Gamma,  ♦Wt,  ♦FixW,  ♦Rho,  ♦Likely,  test; 

register  int  i,  j,  k,  m; 

int  Ns,  count,  proc; 

void  accufflO,  innerprodO; 

/♦  Initialize  Pointers  ♦/ 

Port  =  (unsigned  short  int  ♦)P0RT; 

Combuf  =  (unsigned  short  int  ♦)C0MBUF0; 

In  =  (COMPLEX  ♦)IBUF; 

Out  =  (COMPLEX  ♦)CBUF; 
si  =  (COMPLEX  ♦)0BUF; 

Rho  =  (float  ♦)RBUF; 

Likely  =  (float  ♦)LBUF; 

Gamma  =  (float  ♦) (LBUF  +  13^4) ; 

Wt  =  (float  ♦)WBUF; 

R  =  (float  ♦)(WBUF  +  4^39); 

FixW  =  (float  ♦XWBUF  +  4^52) ; 

Mag  *  (float  ♦)MBUF; 

/♦  Initialize  PIR,  COMBUF  areas  ♦/ 
put_PIR(0) ; 

Combuf [COM.CMD]  »  0; 

Combuf [COM.Ns]  =  0; 

Combuf [COM.ERR]  =  0; 

Combuf [COM.SET]  =  0; 

Combuf [COM.PACK]  *  0; 

Ns  =  0; 
count  *  0; 
test  =  0.0; 

/♦  Main  Dispatch  Loop  —  wait  for  command  in  COMBUFO  ♦/ 
for(;;)  { 

/♦  extract  command  code,  dispatch  to  handler  ♦/ 
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do{}  while  ((command  =  *Combuf)  »=  0); 
m  =  (commemd  k  Oxf f 00) ; 

♦Combuf  *  0; 


/♦  New  Value  of  N  ♦/ 
if  (m  ==  GO.N)  { 

Ns  =  (int)Combuf [COM.Ns] ; 

put_PIR(command) ; 

s2  =  si  +  Ns;  s3  *  s2  +  Ns; 

/♦  Receive  and  process  new  packet  ♦/ 

}  else  if  (m  ==  GO.RCV)  { 
m  =  4*Ns; 

Inptr  =  (short  int  ♦) (IBUF  +  count*8*Ns) ; 
while  (m —  >  0) 

*Inptr++  ®  (short  int) ♦Port; 

Combuf [C0M.PACK1++; 
put.PIR (command) ; 
i  =  (int) (3^count) ; 
j  =  (int) (count^Ns) ; 

/*  Compute  the  syndromes,  ♦/ 

/♦  initializes  if  needed  ♦/ 
accum(-Wt[i] ,AIn[j] , si, Ns, count) ; 
accum(-Wt[++i] ,ftln[j] , s2, Ns, count) ; 
accum(-Wt[++i] ,ftln[j] ,s3, Ns, count) ; 

if  (++count  >=  13)  {  /♦  If  all  13  packets  here,  do  GLRT  ♦/ 

k  =  m  =  0; 

♦Gamma  =  (float) (1.0/ (scale%2448.0)) ; 

innerprod(sl,sl,ftRho[0] ,Ns) ;  /♦  calculate  the  rho's  ♦/ 

innerprod(sl,s2,4Rho[l] ,Ns); 

innerprod(sl,s3,4Rho[2] ,Ns); 

innerprod(s2,s2,ftRho[4] ,Ns); 

innerprod(s2,s3,*Rho[5] ,Ns); 

innerprod(s3,s3,4Rho[8] ,Ns) ; 

/♦  invoke  symmetry  —  ♦/ 

Rho[3]  =  Rho[l];  Rho[6]  =  Rho[2];  Rho[7]  =  Rho[5]; 

/♦  sum  of  autocorrelations  ♦/ 
test  =  (Rho [0] +Rho [4] +Rho [83 ) ; 
proc  =  0;  maxL  =  0.0; 
do  {  /♦  compute  the  likelihoods  ♦/ 

Likely [k]  *  0.0; 

for  (m  =  0;  m  <  9;  m++)  { 

i  =  3^k  +  m/3;  j  =  3^k  +  my,3; 

LikelyCk]  +*  Wt[j]^Wt[i3^Rho[m]^R[k] ; 
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} 

if  (Likely  [k]  >  maixL)  {  /♦  choose  faulty  proc.  */ 
maxL  -  Likely [k] ; 
proc  =  k+1; 

} 

}  while  (++k  <  13) ; 

if  (test  >  ♦Gaaima)  /♦  indicate  failed  proc . ,  if  any  ♦/ 

Combuf [COM.DECIDE]  =  (short  int)proc; 
else 

Combuf [COM.DECIDE]  =  0; 

Combuf [COM.MLE]  =  (short  int)proc; 

j  =  0; 

do  {  /♦  Correct  the  output  */ 

accum(FixW[^proc-l)*13+j] ,ftIn[j*Ns] .Out ,Ns, j) ; 

}  while  (++j  <  13) ; 

innerprod(0ut .Out .Mag.Ns) ; 

/♦  Avg.  mag" 2  of  corrected  output  */ 

♦Mag  =  (float) (♦Mag/Ns) ; 

count  =  0;  test  =  0.0; 

Combuf [C0M.SETJ++; 

put_PIR(G0_D0NE) ;  /*  Signal  that  Likelihoods  may  be  read  ♦/ 

} 

}  else  { 

Combuf  [COM.ERR]  =  comm2Lnd; 
put_PIR(ACK_ERR) ; 

} 

} 

} 

void  accum(weight. data. sum. N. flag) 
float  weight; 

COMPLEX  ♦data,  ♦sum; 
int  N.  flag; 

{ 

register  int  i  =  0; 
if  (flag  !»  0) 
flag  »  1; 
do  { 

sum [i] .real  =  (float) (sum[i] .real^f lag  +  data[i] .real^weight) ; 
sum[i].imag  =  (float) (sum[i] , imag^f lag  +  data[i] . imag^weight) ; 

}  while  (++i  <  N) ; 
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} 


void  iniierprod(vl,v2,out ,N) 

COMPLEX  ♦vl,*v2; 
float  *out; 
int  N; 

register  int  i  =  0; 

♦out  ®  0.0; 
do  { 

♦out  +=  ((vl[i] .real^v2[i] .real)+(v2[i] .imag*vl[i] .imag)) /scale; 
}  while  (++i  <  N) ; 

} 

/*  WRITE_WEIGHTS . C  :  stores  the  weight  matrix  at  WBUF  in  external 
memory  on  BEAM.MLE,  prior  to  running.  It  is  loaded  and  run  at 
the  same  time  as  write.coef*,  and  write.csp^. 

♦/ 


♦include  ''dsp32c.h" 

♦define  total  13 
♦define  check  3 
float  wt [check] [total]  = 

{■Cl.O,  1.0,  1.0,  1.0,  0.0,  0.0,  1.0,  1.0,  1.0,  1.0, -1.0,  0.0,  0.0}, 
{1.0,  1.0, -1.0, -1.0,  1.0,  1.0,  0.0,  0.0,  1.0, -1.0,  0.0, -1.0,  0.0}, 
{1.0, -1.0,  1.0, -1.0,  1.0, -1.0,  1.0, -1.0,  0.0,  0.0,  0.0,  0.0, -1.0}}; 
♦define  ONETHIRD  0.3333333333333333 
♦define  TWOTHIRD  0.6666666666666666 

float  rkk [total]  =  {ONETHIRD,  ONETHIRD,  ONETHIRD,  ONETHIRD,  0.5, 

0.5,  0.5,  0.5,  0.5,  0.5,  1.0,  1.0,  1.0}; 

/♦  The  following  matrix  is  used  when  correcting  faults  ♦/ 
float  fixW  [total]  [total]  *  { 

{0.,  -ONETHIRD,  -ONETHIRD,  ONETHIRD,  -TWOTHIRD,  0.,  -TWOTHIRD,  0., 
-TWOTHIRD,  0.,  ONETHIRD,  ONETHIRD,  ONETHIRD}, 

{-ONETHIRD,  0.,  ONETHIRD,  -ONETHIRD,  0.,  -TWOTHIRD,  0.,  -TWOTHIRD, 
-TWOTHIRD,  0.,  ONETHIRD,  ONETHIRD,  -ONETHIRD}, 

{-ONETHIRD,  ONETHIRD,  0.,  -ONETHIRD,  0.,  TWOTHIRD,  -TWOTHIRD,  0., 
0.,  -TWOTHIRD,  ONETHIRD,  -ONETHIRD,  ONETHIRD}, 

{ONETHIRD,  -ONETHIRD,  -ONETHIRD,  0.,  TWOTHIRD,  0.,  0.,  -TWOTHIRD, 
0.,  -TWOTHIRD,  ONETHIRD,  -ONETHIRD,  -ONETHIRD}, 

{-1.,  0.,  0.,  1.,  0.,  0.,-0.5,  0.5, -0.5,  0.5,  0.,  0.5,  0.5}, 

{  0.,-l.,  1.,  0.,  0.,  0.,  0.5, -0.5, -0.5,  0.5,  0.,  0.5, -0.5}, 

{-1.,  0.,-l.,  0.,-0.5,  0.5,  0.,  0. ,-0.5, -0.5,  0.5,  0.,  0.5}, 

{  0.,-l.,  0.,-l.,  0.5, -0.5,  0.,  0., -0.5, -0.5,  0.5,  0.,-0.5}, 
{-l.,-l.,  0.,  0. ,-0.5, -0.5, -0.5, -0.5,  0.,  0.,  0.5,  0.5,  0.}, 
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{  0.,  O.,-!.,-!.,  0.5,  0.5, -0.5, -0.5,  0.,  0.,  0.5, -0.5,  0.}, 

•C  1.,  1.,  1.,  1.,  0.,  0.,  1.,  1.,  1.,  1.,  0.,  0.,  0.}, 

{  1.,  1. 1.,  1.,  0.,  0.,  l.,-l.,  0.,  0.,  0.}, 

{  l.,-l.,  0.,  0.,  0.,  0.,  0.} 

}; 

mainO  { 

float  ♦weight,  ♦Rkk,  ♦Fix; 
register  int  i,  j; 

weight  =  (float  ♦)WBUF; 

Rkk  =  (float  ♦)(WBUF  +  4^39) ; 

Fix  =  (float  ♦) (WBUF  +  4^52) ; 

for  (i  =  0;  i  <  total;  i++)  { 

♦Rkk++  =  rkk[i] ; 
for  (j  =0;  j  <  total;  j++)  { 
if  (j  <  check) 

♦weight++  =  (float) (wt [j] [i]/16.0) ; 

/♦  write  weights  in  column  major  order  ♦/ 
♦Fix++  =  fixW[i][j]; 

/♦  write  fixW  in  row  major  order  ♦/ 

} 

} 

} 

/*****************************************************/ 

/♦  ♦/ 

/♦  DSP32c.h  ::  Include  file  for  the  DSP's  ♦/ 

/♦  ♦/ 

/******************************^i******it***************/ 

/♦  The  FIFO  port  address  ♦/ 

«define  PORT  OxACOOO 

/♦  Define  the  codes  for  communication  with  the  DSP32C's 
Upper  8-bits  =  code 
Lower  8-bits  =  transaction  number 

♦/ 

♦define  G0_N  0x0100 
♦define  GO.XMIT  0x0200 
♦define  GO.RCV  0x0300 
♦define  GO.DONE  0x0400 

♦define  ACK.ERR  Oxffff 
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/*  External  memory  data  buffer  addresses  */ 

/*  Define  the  communication  buffer  area  DSP32C  addresses  */ 

/*  Shorts  ♦/ 

•define  COMBUFO  OxffffeO 
•define  COMBUFl  0xffffe2 
•define  C0MBUF2  0xffffe4 
•define  C0MBUF3  OxffffeS 
•define  C0MBUF4  OxffffeS 
•define  CQMBUF5  Oxffffea 
•define  C0MBUF6  Oxffffec 
•define  C0MBUF7  Oxffffee 
/*  Floats  ♦/ 

•define  COMBUFOf  OxfffffO 
•define  COMBUFlf  0xfffff4 
•define  C0MBUF2f  OxfffffS 
•define  C0MBUF3f  Oxfffffc 

•include  "my .macros .h" 

•define  ASPC  0.0523598776  /♦  angle  between  elements  (radians)  ♦/ 

•define  Fc  200000.0  /*  carrier  frequency  in  Hz  */ 

•define  DELTA  5.00e-5  /♦  quad,  sampling  period  (sec)  ♦/ 

•define  DEL  7. 142857143e-6  /♦  interp.  samp,  period  (sec)*/ 
•define  WRAD  0.5729166667  /*  array  radius  in  feet  */ 

•define  C  5000.0  /*  speed  of  sound  in  feet/sec  */ 

•define  HO  1 .2566370614e+6  /*  Sonar  freq.  in  radians  ♦/ 

•define  IBUF  0x008000  /♦  Stores  incoming  quadrature  data  */ 

/*  These  defines  for  beam,  csp  channels  only  ♦/ 

•define  BBUF  0x00b040  /*  Beam  Output,  starts  at  BEAM  +  ORDER  */ 

•define  BEAM  OxOObOOO  /*  Interpolation  results  */ 

•define  MBUF  OxOOdSOO  /*  average  magnitude  over  results  ♦/ 

/*  These  defines  for  beam.mle  channel  only  ♦/ 

•define  OBUF  OxOOaOOO  /♦  Syndrome  buffers  in  beam.mle  ♦/ 

•define  RBUF  OxOObOOO  /*  Syndrome  cross-corr.  in  beam.mle  ♦/ 

•define  CBUF  OxOOcOOO  /♦  Corrected  proc.  output  in  beam.mle  */ 

•define  LBUF  OxOOdOOO  /♦  Likelihoods  and  thresh,  in  beam.mle  ♦/ 

•define  HBUF  OxOOeOOO  /♦  Heights  used  by  beam.mle  ♦/ 

float  theta[]  =  {-22.5,-17.5,-12.5,-7.5,-2.5,2.5,7.5,12.5,17.5,22.5}; 


/♦  MY.MACROS.H  */ 

•define  M.PI  3.14159265358979323846 
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♦define  M_PI_2  1.57079632679489661923 
♦define  M_1.PI  0.31830988618379067154 


♦define  L 

7 

/*  interpoltaion  ratio 

♦/ 

♦define  LEN 

9 

/*  length (PolyPhase  Filter) 

*/ 

♦define  ORDER 

8 

/*  LEN  -  1 

*/ 

♦define  Ne 

15 

/♦  number  of  hydrophones 

♦/ 

♦define  min(el,e2) 
♦define  meuc(el,e2) 
♦define  abs(el) 
♦define  sign(el) 


(float) (el)  <=  (float) (e2)  ?  (float) (el) : (float) (e2) 
(float) (el)  >=  (float) (e2)  ?  (float) (el) : (float) (e2) 
(float) (el)  >=  0.0  ?  (float) (el) : (float) (-el) 

(float) (el)  >=0.0  ?  (int) (l) : (int) (-1) 


♦define  FILTBUF  OxOOcOOO  /*  Filter  Coefficient  Storage  */ 


typedef  struct  { 
float  real; 
float  imag; 

}  COMPLEX ; 


/*  LIBB.H  ♦/ 

extern  float  mod(),  mod_sin(),  sqrt(); 
extern  int  roundO; 
extern  void  filterO  ; 


/*  BEAMTRANS.C  -  The  main  subroutine  which  coordinates  the  fault 
tolerance  application  for  N=10,  C=3. 

The  program  runbeam(start ,stop)  sequences  all  activity 


♦define  DEBUG  1 

♦define  b.majc  0  /*  maximum  number  of  whole  integer  delays  ♦/ 

♦define  maixbat  16  /*  maximum  number  of  data  batches  allowable  */ 

♦define  endbat  47  /*  last  batch  for  which  there  is  data  */ 

♦include  <stdio.h> 

♦include  <math.h> 

♦include  "dspstruct .h" 

♦include  "macros. h" 

♦include  "scrn.h" 

int  taberr.short 0 ; 

static  void  magnitudesO  ; 

struct  tabtrans  tramsferDC]  ,transferB[]  ; 

/♦  Help  subroutine  for  this  file  */ 
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void  tt_help() 

{ 

printf ("FAULT  TOLERANCE  APPLICATION  SEQUENCER  (tabtrans .c)\n") ; 
printfC"  runbeamO  -  run  all  trans.  once\n") ; 
printf ("  runbeaua(nrun)  -  run  all  trans.  nrun  times\n"); 
printfC"  runbeam(start,stop)  -  run  trans.  start  through  stop\n") ; 
printfC"  readcomC)  -  disp.  and  interpret  all  comm.  buffers\n"); 
printfC"  readbeam(board,proc)  -  disp.  buffers  for  beam.c  proc.Xn"); 
printfC"  readbeam_2Lll()  -  disp.  buffers  for  all  beam.c  procs.Xn"); 
printfC"  readcsp (board)  -  disp.  buffers  for  given  csp.c  proc.Xn"); 
printfC"  readcsp_all()  -  disp.  buffers  for  all  csp.c  procs.Xn"); 
printfC"  readbeam_mle()  -  disp.  buffers  for  beam.mle.c  proc.Xn"); 
printfC"  weightsO  -  prints  weight  matrix  from  beam.mle  proc.Xn"); 
printf ("Xn"); 
return; 

} 


/♦  Define  struct  which  keeps  track  of  which  application  has  been 
loaded  into  the  DSP32C's 


♦/ 

int  Nappl  =  -1; 
struct  appl  apples []  =  { 
{"N=l,  C=0  ", 
transferD,  1, 

1. 

1,  3. 

0, 

2,  3. 

0, 

{0,0, 0,0,  0,0,  0,0, 
{0.0, 0,0,  0,0,  0,0, 
0. 

0. 

{0.0, 0,0}, 

{0,0. 0,0}}, 


/♦  tramsactions  table  and  length  */ 

/*  number  of  packets  transmitted  by  input  */ 
/*  input  (board, proc)  ♦/ 

/*  number  of  packets  input  by  beam_mle  */ 

/*  beam_mle  (board, proc)  */ 

/♦  number  of  beam  processors  */ 
0,0,  0,0},  /*  beam  boards  in  order  */ 

0,0,  0,0},  /*  beam  procs  in  order  */ 

/♦  number  of  csp  processors  */ 

/♦  number  of  packets  input  by  each  csp  */ 

/*  csp  boards  in  order  */ 

/♦  csp  procs  in  order  */ 


{"N=10,  C=3  (weights  0, 
tramsferB,  17, 

4, 

1.  3, 

13, 

2.  3. 

10. 

{0.0. 0,0,  1,1,  2.2. 
{0,1, 2, 3.  1.2.  1.2. 


+-!)", 

/*  transactions  table  and  length  ♦/ 

/♦  number  of  packets  transmitted  by  input  */ 
/*  input  (board, proc)  list  */ 

/*  number  of  packets  input  by  beam_mle  */ 

/*  beam_mle  (board, proc)  list  */ 

/♦  number  of  beam  processors  */ 
3,3,  -1,-1},  /*  beam  boards  in  order  ♦/ 

1,2,  -1,-1},  /*  beam  procs  in  order  */ 


83 


3,  /*  number  of  csp  processors  ♦/ 

1,  /*  number  of  packets  input  by  each  csp  ♦/ 

{1,2,3,-!},  /*  csp  boards  in  order  ♦/ 

{0,0,0,-!}},  /*  csp  procs  in  order  */ 

{NULL,  NULL,  0,  0} 

}; 

/*  Define  hardware  addresses  for  the  !6  DSP32C’s  PIO  reg.  banks  *! 
#define  PIOO.O  (struct  dspstruct  *) (boardO+OxOO) 

#define  PI00_!  (struct  dspstruct  ♦) (board0+0x40) 

#def ine  PI00_2  (struct  dspstruct  ♦) (board0+0x80) 

#def ine  PI00_3  (struct  dspstruct  ♦) (boardO+OxcO) 

#def ine  PI0!_0  (struct  dspstruct  *) (board!+0x00) 

#define  PI0!_!  (struct  dspstruct  *) (board!+0x40) 

•define  PI0!_2  (struct  dspstruct  *) (board! +0x80) 

•define  PI0!_3  (struct  dspstruct  *) (board!+0xc0) 

•define  PI02_0  (struct  dspstruct  *) (board2+0x00) 

•define  PI02_!  (struct  dspstruct  *) (board2+0x40) 

•define  PI02_2  (struct  dspstruct  ♦) (board2+0x80) 

•define  PI02_3  (struct  dspstruct  ♦) (board2+0xc0) 

•define  PI03_0  (struct  dspstruct  ♦) (board3+0x00) 

•define  PI03_!  (struct  dspstruct  ♦) (board3+0x40) 

•define  PI03_2  (struct  dspstruct  ♦) (board3+0x80) 

•define  PI03_3  (struct  dspstruct  ♦) (board3+0xc0) 

/♦  Define  table  of  data  packet  transfers  */ 

/*  struct  tabtrans  { 

unsigned  short  int  lOboard,  SWboardO.SWboard! ,SWboard2,SWboard3; 
struct  dspstruct  *pPI0trans; 

struct  dspstruct  ♦pPIOrcvO,  *pPI0rcv!,  *pPI0rcv2,  *pPI0rcv3; 
int  transboard; 
char  ^comment; 

};  ♦/ 

/*  Define  table  of  data  packet  treuxsfers  for  N=!,  C=0  application  ♦/ 
static  struct  tabtrans  transferD[]  =  { 

{  lO.BrdO, 

SW_In0,  SW_0ut3  I  SW.OutEn  I  SH.LED,  NULL,  NULL, 

PI0!_3,  PI00_0,  NULL,  NULL,  NULL,  !, 

"Input  data  from  DSP[!]  [3]  to  DSP[0][0]"} 
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>; 


/*  This  is  the  table  defining  the  transfers  for  N=10,  C=3  ♦/ 
static  struct  tabtrans  transfers []  =  •( 

{  ID.BrdO  ft  lO.Brdl  ft  I0_Brd2  ft  lO.BrdS, 

SW.InO,  SW.InO  I  SW_0ut3  I  SW.OutEn  I  SW.LED,  SW.InO,  SW.InO, 

PI01.3,  PIOO.O,  PlOl.O,  PI02_0,  PI03_0,  1, 

"Input  data  from  DSP[1][3]  to  DSP[03[0],  DSP[1][0],  DSP[2][0],  DSP[3][0]"}, 
{  lO.BrdO  ft  lO.Brdl  ft  I0_Brd2  ft  I0_Brd3, 

SW.Inl,  SW.Inl  I  SW_0ut3  1  SW.OutEn  I  SW.LED,  SW.Inl,  SW.Inl, 

PIQ1.3,  PIQO.l,  PlOl.l,  PI02_1,  PI03_1,  1, 

"Input  data  from  DSP[l][3]  to  DSP[0][l],  DSP[l]  [l]  ,  DSP[2][l],  DSP[3][l]"}, 
{  lO.BrdO  ft  lO.Brdl  ft  I0_Brd2  ft  I0.Brd3, 

SW_In2,  SW_In2  I  SW_0ut3  I  SW.OutEn  I  SW.LED,  SW.In2,  SW.In2, 

PI01.3,  PI00.2,  PI01.2,  PI02.2,  PI03.2,  1, 

"Input  data  from  DSP[1]  [3]  to  DSP[0]  [2]  ,  DSP[l]  [2]  ,  DSP [2]  [2]  ,  DSP[3][2]"}, 
{  lO.BrdO, 

SW.In3,  SW_0ut3  I  SW.OutEn  I  SW.LED,  NULL,  NULL, 

PI01.3,  PI00.3,  NULL,  NULL,  NULL,  1, 

"Input  data  from  DSP[l][3]  to  DSP[0][3]"}, 

{  I0.Brd2, 

SW.OutO  I  SW.OutEn  |  SW.LED,  NULL,  SW_In3,  NULL, 

PIOO.O,  PI02.3,  NULL,  NULL,  NULL,  0, 

"Output  data  from  DSP[0][0]  to  DSP [2]  [3]"}, 

{  I0_Brd2, 

SW.Outl  I  SW.OutEn  |  SW.LED,  NULL,  SW_In3,  NULL, 

PIOO.l,  PI02_3,  NULL,  NULL,  NULL,  0, 

"Output  data  from  DSP[0][1]  to  DSP[2][3]"}, 

{  I0_Brd2, 

SW_0ut2  1  SW.OutEn  |  SW.LED,  NULL,  SW_In3,  NULL, 

PI00_2,  PI02.3,  NULL,  NULL,  NULL,  0, 

"Output  data  from  DSP[0][2]  to  DSP  [2]  [3]"}, 

{  I0_Brd2, 

SW_0ut3  1  SW.OutEn  |  SW.LED,  NULL,  SW_In3,  NULL, 

PI00_3,  PI02.3,  NULL,  NULL,  NULL,  0, 

"Output  data  from  DSP[0][3]  to  DSP  [2]  [3]"}, 

i  I0_Brd2, 

NULL,  SW.Outl  I  SW.OutEn  I  SW.LED,  SW.In3,  NULL, 

PlOl.l,  PI02.3,  NULL,  NULL,  NULL,  1, 

"Output  data  from  DSP[l][l]  to  DSP[2]C3]"}, 

{  I0.Brd2, 
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NULL,  SW_0ut2  I  SW.OutEn  I  SW.LED,  SH_In3.  NULL, 

PI01_2,  PI02.3,  NULL,  NULL,  NULL,  1, 

"Output  data  from  DSP[1] [2]  to  DSP [2] [3]"}, 

{  I0_Brd2, 

NULL,  NULL,  SW.Outl  I  SW_In3  I  SW.OutEn  I  SW.LED,  NULL, 
PI02_1,  PI02.3,  NULL,  NULL,  NULL,  2, 

"Output  data  from  DSP[2][l]  to  DSP  [2]  [3]"}, 

-C  I0.Brd2, 

NULL,  NULL,  SW.0ut2  I  SW.In3  I  SW.OutEn  I  SW.LED,  NULL, 
PI02_2,  PI02.3,  NULL,  NULL,  NULL,  2, 

"Output  data  from  DSP  [2]  [2]  to  DSP  [2]  [3]  "}■ , 

{  I0.Brd2, 

NULL,  NULL,  SW.In3,  SW.Outl  I  SW.OutEn  1  SW.LED, 

PI03_1,  PI02_3,  NULL,  NULL,  NULL,  3, 

"Output  data  from  DSP[3]  [1]  to  DSP[2][3]"}, 

■C  I0.Brd2, 

NULL,  NULL,  SW.In3,  SW.0ut2  I  SW.OutEn  I  SW.LED, 

PI03.2.  PI02.3,  NULL,  NULL,  NULL,  3, 

"Output  data  from  DSP [3] [2]  to  DSP [2] [3]"}, 

{  I0.Brd2, 

NULL,  SW.OutO  I  SW.OutEn  I  SW.LED,  SW.In3,  NULL, 

PlOl.O,  PI02.3,  NULL,  NULL,  NULL,  1, 

"Checksum  Output  from  DSP[1]C0]  to  DSP[2]  [3] "}, 
i  I0_Brd2, 

NULL,  NULL,  SW.OutO  I  SW_In3  I  SW.OutEn  I  SW.LED,  NULL, 
PI02_0,  PI02_3,  NULL,  NULL,  NULL,  2, 

"Checksum  Output  from  DSP[2][0]  to  DSP [2]  [3]"}, 

{  I0_Brd2, 

NULL,  NULL,  SW.In3,  SW.OutO  I  SW.OutEn  I  SW.LED, 

PI03_0,  PI02_3,  NULL,  NULL,  NULL,  3, 

"Checksum  Output  from  DSP[3][0]  to  DSP [2] [3]"} 

>; 

/*  This  is  the  routine  which  sequences  all  activity  */ 

/*  Basic  communication  protocol: 


Runbeam  writes  commauid  into  location  COMBUFO  in  DSP32C,  together 
with  any  auxiliary  data  in  locations  COMBUFl-7.  Upper  8  bits  is 
the  command,  lower  8  bits  is  a  unique  transaction  number. 


DSP32C  waits  for  a  new  command  to  appear  in  memory.  It  over¬ 
writes  the  command  with  -1  to  indicate  that  it  read  it,  and 
executes  the  command.  When  ready,  it  copies  the  command  into  PIR 
to  acknowledge  the  action.  It  then  returns  to  the  beginning  to 
await  a  new  commemd. 
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*/ 

/*  Global  error  injection  variables  ♦/ 
float  dataval[68] ;  /♦  Uncorrupted  data  ♦/ 

float  errval[2];  /♦  Injected  error  values  (constant)  ♦/ 
int  errtrans;  /*  Inject  error  just  before  this 

transaction  (if  any)  */ 

int  errboard;  /*  board  number  to  be  affected  ♦/ 

int  errproc;  /*  proc  number  to  be  affected  */ 

short  int  errflag;  /♦  indicates  error  type  */ 
int  ncorrect[4];  /♦  number  of  correct/incorrect  diagnoses 

above/below  threshold.  [0]=correct  above; 
[l]=correct  below;  [2]=incorrect  above; 

[3] “incorrect  below  ♦/ 

int  histogram[15] ;  /*  histogram  of  error  decisions  ♦/ 

/*  For  use  with  writing/ reading  actual  sonar  data  —  MATLAB  format  */ 
char  varout []  =  "BOO" ; 

char  fileinC]  =  "/usr/usr/ftsp/wbbl545/beam/input/Xxx.mat"; 
char  fileoutC]  =  "/usr/usr/ftsp/wbbl545/beam/batch/batch00 .mat" ; 
char  mskfileC]  =  "/usr/usr/ftsp/wbbl545/beam/input/mask.mat" ; 

void  runbeam(start,stop) 
int  start, stop; 

{ 

int  i,j ,ti,il,i2,  board,  proc; 

int  nrun , nr , f lag , dat af lag , mskf lag, rdf lag , Idf lag ; 

short  int  err,  k; 

unsigned  short  int  *I0CRptr; 

unsigned  short  int  *Sptr; 

struct  tabtrans  *thistable; 

register  struct  tabtrans  ♦ptrTab; 

register  int  safety, commeind; 

register  struct  dspstruct  *pPI0xmit; 

register  struct  dspstruct  *pPI0rcv0,*pPI0rcvl ,*pPI0rcv2,*pPI0rcv3; 
void  rd() ,wd() ; 

FILE  *mfp,  *fopen() ;  /♦  Masking  data  for  fault  option  2  */ 

long  dummy;  /*  Natlab  ♦/ 

char  *buf;  /*  file  header  */ 

double  temp;  /*  parameters  */ 

Sptr  =  (unsigned  short  int  ♦)IOSTATREC; 
lOCRptr  =  (unsigned  short  int  *)I0C0NREG; 

/*  Initialize  error  statistics  ♦/ 
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errtrans  =  -1; 
errboard  =  -1; 
errproc  =  -1; 
errflag  *  0; 
rdf lag  =  0; 
mskflag  «  0; 

/*  Figure  out  which  application,  has  been  loaded  */ 

f  indapplO ; 

if(Nappl  <  0)  return; 

printf ("Running  application  Xs\n", apples [Nappl] .comment) ; 
thistable  =  apples [Nappl] .table; 

/*  Parse  arguments  ♦/ 
if (stop  ==  0)  { 
if (start  ==  0)  { 
nrun  =  1; 

)•  else  { 

nrun  =  start; 

} 

start  =  0; 

stop  =  apples [Nappl] .ntrans; 

}  else  •[ 
nrun  =  1; 


/*  Main  host  data  packet  transfer  loop  ♦/ 

/♦  Setup  PCR  registers  - 

Run,  regular  map,  DMA,  16 'bit,  non-autoincrement  */ 
for(i®0  ;  i  <  NBOARD  ;  i++)  •( 
for(j=0  ;  j<4  ;  j++)  { 
il  =  ptrDSP[i][j]  ->  PCR; 

ptrDSP[i][j]  ->  PCR  «  ((il  ft  PCR.RUN)  I  PCR.REG  |  PCR.DMA  |  PCR_PI016) ; 

> 

} 

readPIOO ; 
if(askynq()  ==  -1) 
return; 

/♦  select  level  of  status  printout  during  run  */ 
flag  *  1; 

printf ("Choose  level  of  printout  during  run:\n"); 

printf ("  0  -  print  avg.  magnitudes  for  each  run  without  pause\n"); 
printf ("  1  -  print  avg.  magnitudes,  stop  if  error  above  thresh. \n"); 
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printfC"  2  -  print  avg.  magnitudes  for  each  run  with  pause\n") ; 
printfC"  3  -  print  avg  magnitudes  and  trans.  details\n") ; 
printfC"  4  -  print  avg.  magnitudes,  trans.  details,  PIO  reg.\n"); 
askint ("Choice:  ",ftflag); 

dataflag  *  askpromptC'Use  actual  sonar  data?  "); 
if  (dataflag  <  0)  return; 
if  (dataflag  >  0)  { 

Idflag  =  askprompt ("Load  data  into  INPUT  DSP?  "); 
if  (Idflag  >  0)  { 

askint ("  starting  from  batch:  ",Ai); 

il  »  (int) (i-l+maxbat)>(int)endbat  ?  (int) (endbat-i+1) : (int)maxbat ; 
nrun  =  (int)il; 
for(j=0;  j  <  il;  j++) 
wd(j ,i) ; 

}•  else  if  (Idflag  ==  0)  nrun  =  (int)maxbat; 
else  return; 

printf ("Select  output  datafile  save  option:\n"); 
printfC"  0  -  None\n"); 

printfC"  1  -  Save  uncorrected  (corrupted)  output\n") ; 
printfC"  2  -  Save  corrected  output\n"); 
askint ("Choice:  " .ftrdflag) ; 

} 

/♦  Initialize  error  statistics  */ 
for(i=0  ;  i<4  ;  i++)  { 
ncorrectCi]  =  0; 

} 

for(i=0  ;  i<®apples[Nappl] .nbeam  +  apples [Nappl] .ncsp  ;  i++)  { 
histogramCi]  =  0; 

> 

/♦  Determine  whether  to  screw  up  any  output  buffers  in  the  DSP32C’s  ♦/ 
il  *  askprompt ("Force  error  in  BEAM  or  CHECKSUM  processor?  "); 
if(il  <  0)  return; 
ifCil  >  0)  -C 
errboard  *  0; 
errproc  “  0; 

if (askint ("Channel  proc.  (1)  or  checksum  proc.  (2):  ",ftil))  { 
switch(il)  { 
case  1 : 

printf ("There  are  channel  procs.  1  to  Xd\n" .apples [Nappl] .nbeam) ; 
askintC'Hhich  cheinnel  proc.?  ",Ail); 
errboard  =  apples [Nappl] .beam_bd[ — il] ; 
errproc  =  apples [Nappl] .beam_pr[il] ; 
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break; 


} 


case  2: 

printf ("There  are  checksum  procs.  1  to  5Cd\n", apples [Nappl] .ncsp) ; 
askint ("Which  checksum  proc.? 

errboard  =  apples [Nappl] . csp.bd [ — il] ; 
errproc  *  apples [Nappl] .csp_pr[il] ; 
break; 

} 

} 

if (askint ("Constant  Error  (1)  or  Masking  Error  (2):  ".kmskflag))  { 
errtrans  =  apples [Nappl] .input .pack; 
if  (mskflag==l)  { 

printf ("Options :\n") ; 
printf ("  1  -  Add  constant  to  data\n") ; 

printf ("  2  -  Set  data  to  constant\n") ; 

printf  ("  3  -  Add  constant  to  single  saunple\n"); 

errvalCO]  =  0.0;  errval[l]  =  0.0; 
if (askint ("Choose:  ",ftil))  { 
errflag  =  il; 
if  (il  <  3)  { 

askfloatC  Real  value?  ",Aerrval[0])  ; 
askfloat("  Imaginary  value?  ”,Aerrval[l]) ; 

}  else  { 

askint("  Which  sample?  ",iil); 
askfloat("  Real  value?  ",fterrval[0]) ; 
askfloatC  Imaginary  value?  " ,fterrval [1]  )  ; 

} 

} 

} 

else  if  (mskflag==2)  { 

mfp  =  fopen(mskfile,"rb") ; 

/♦  Prepare  matlab  data  file  ♦/ 


f read (Adummy,sizeof (long) .1, mfp) ;  /♦  eat  machine  ID  ♦/ 

f read (tdummy.sizeof (long) ,1, mfp) ;  /♦  eat  #  rows  */ 

f read (4dummy,sizeof (long), 1, mfp);  /♦  eat  #  cols  ♦/ 

fread(ftdummy,sizeof (long) ,1, mfp);  /*  eat  imag.  data  flag  ♦/ 

f read (4dufflmy,sizeof (long) , 1, mfp);  /♦  var  name  length  ♦/ 

buf  =  (char  ♦) calloc (dummy, sizeof (char)) ; 

fread(buf  ,sizeof( char)  , dummy, mfp)  ;  /*  eat  var  naune  */ 

/*  Now  data  points  may  be  read  in  ♦/ 

} 


} 
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i  «  apples [Nappl] .nbeam; 

for  (j  “  0;  j  <  i+apples [Nappl] .ncsp;  j++)  { 
if  (j  <  i)  { 

board  *  apples  [Nappl]  .beain_bd[j]  ; 
proc  =  apples  [Nappl]  .beain_pr[j]  ; 

]•  else  •( 

board  =  apples  [Nappl]  .csp_bd[j-i]  ; 
proc  »  apples [Nappl] .csp_pr[j“i] ; 

> 

k  ®  0; 

if  ((errboard  ==  board)  (errproc  ==  proc)  tt  (mskflag  *=  1)) 
k  =  1; 

wr.s(board,proc,C0MBUF6,l,kk);  /*  Faulty  Proc.  (con.'^t.  fault)  ♦/ 
k  =  (short  int) (errf lag) ; 

wr_s(board,proc,C0MBUF4,l,kk) ;  /*  Fault  Type  (const,  fault)  */ 
k  *  (short  int)(il); 

wr_s(board,proc,C0MBUF7, l,ftk) ;  /*  Faulty  Samp,  (if  applicable)  */ 
wr_fdsp(boaxd,proc,C0MBUF0f ,l,fterrvaj.[0]) ;  /♦  Real  fault  val.  */ 
wr_fdsp(board,proc,COMBUFlf ,l,fterrval[l]) ;  /♦  Imag  fault  val.  */ 

y 


/* 

Write  Ns,  b.max  into  COMBUF  1,5.  Code  GO.N  into  COMBUFO 
Wait  to  make  sure  that  each  pro.  returns  a  GO.N  acknowledgement 

♦/ 


if (start  ==  0)  { 

for(i=0  ;  KNBOARD  ;  i++)  { 
for(j=0  ;  j<4  ;  j++)  { 
pPIOxmit  =  ptrDSP[i]  [j] ; 

pPIOxmit  ->  PIR  =0;  /♦  Init  PIR  reg.  to  default  value  ♦/ 

pPIOxmit  ->  PARE  *  COMBUFh; 

pPIOxmit  ->  PAR  =  COMBUFl; 

pPIOxmit  ->  PDR  =  (short  int) (Ns) ; 

pPIOxmit  ->  PAR  =  C0MBUF5; 

pPIOxmit  ->  PDR  «  (short  int)(b.max); 

pPIOxmit  ->  PAR  »  COMBUFO; 

pPIOxmit  ->  PDR  ■  GO.N; 

safety  ®  0; 

while (pPIOxmit->PIR  !«  GO.N)  { 
if(safety++  >  SAFENUM)  { 
safety  «  0; 

/♦  readPIOO ;  ♦/ 

printf ("Waiting  for  (%d,Xd)  to  acknowledge  new  Ns  value. \n", 

i.j): 
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printfC"  Current  PIR=Xx.  Expecting  Xx.  Wait  ", 
pPIOxmit->PIR,GQ_N) ; 
il  =  askynqO ; 
if(il  ==  0)  { 
break; 

}  else  if(il  -1)  { 
return; 

} 

} 

} 

if (flag  >=  3)  { 

printf ("Acknowledge  code  (PIR) :  =  Xx  (#tries=Xd)\n" , 
pPIOxmit->PIR, safety) ; 

} 

} 

} 

} 


dispinitOO ; 

/*  LOOP  THROUGH  PROGRAM  NRUN  TIMES  */ 
for (nr  =  1  ;  nr  <=  nrun  ;  nr++)  { 
printf ("♦♦♦  BATCH  Xd  ***\n",nr); 

for(i  =  start  ;  i  <  stop  ;  i++)  {  /♦  DO  ALL  TRANSACTIONS  ♦/ 

ptrTab  =  &thistable[i] ; 

if (flag  >=  3)  { 

printf ("\nTransaction:  Xd\n"  ,i  ); 
printfC  Xs\n"  .ptrTab  ->  comment  ); 

} 

/♦  Inject  a  random  error,  if  desired,  before  this  transaction  */ 
if  ((i==errtrans)  tt  (mskflag==2))  i 
printf ("INJECTING  ERROR  INTO  PROCESSOR  Xd.Xd", 
errboard.errproc) ; 
for  (il“0;  il  <  2*Ns;  il++)  { 

f read (it emp.sizeof (double) ,l,mfp) ; 
datavalCil]  =  (float)temp; 

} 

wr_f  dsp (errboard ,  errproc , BBUF ,  2’*‘Ns ,  dat aval)  ; 

} 

/*  Setup  the  10  board  ♦/ 

♦lOCRptr  -  lOCR  =  ptrTab  ->  lOboard; 
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/♦  Setup  the  FIFO  switch  registers  on  each  DSP32C  board  */ 
ptrODSPO  ->  PARE  =  DSPSWREGh; 
ptrODSPO  ->  PAR  *  DSPSWREGl; 

ptrODSPO  ->  PDR  =  DSPCR[0]  =  ptrTab  ->  SWboardO; 

ptrlDSPO  ->  PARE  =  DSPSWREGh; 
ptrlDSPO  ->  PAR  =  DSPSWREGl; 

ptrlDSPO  ->  PDR  «  DSPCRCl]  =  ptrTab  ->  SWboardl; 

ptr2DSP0  ->  PARE  =  DSPSWREGh; 
ptr2DSP0  ->  PAR  *  DSPSWREGl; 

ptr2DSP0  ->  PDR  =  DSPCR[2]  =  ptrTab  ->  SWboard2; 

ptr3DSP0  ->  PARE  =  DSPSWREGh; 
ptr3DSP0  ->  PAR  =  DSPSWREGl; 

ptr3DSP0  ->  PDR  =  DSPCR[3]  =  ptrTab  ->  SWboard3; 

/*  Send  code  to  start  the  transmission  */ 

pPIOxmit  =  ptrTab  ->  pPIOtrans; 
if (flag  >=  3)  { 

printf  ("prior  transmitter  PIR  =  */,x\n"  ,pPIOxmit->PIR) ; 

} 

pPIOxmit  ->  PARE  =  COMBUFh; 
pPIOxmit  ->  PAR  =  COMBUFO; 
command  =  G0_XMIT+i; 
pPIOxmit  ->  PDR  =  command; 

/♦  wait  until  acknowledge  code  from  DSP  in  PIR  register  ♦/ 

/*  (verify  that  DSP  is  has  begun  trzmsmission)  ♦/ 

safety  =  0; 

while (pPIOxmit  ->  PIR  !=  command)  { 
if(safety++  >  SAFENUM)  { 
readPIOO ; 
safety  *  0; 

for(il=0  ;  iKNBOARD  ;  il++)  { 
for(i2*0  ;  i2<4  ;  i2++)  { 

if (ptrDSP[il] [i2]  ==  pPIOxmit)  { 
printf  ("Transaction  Xd:  Waiting  for  transmitter  (*/.d,'/.d)  An", 
i,il,i2); 

break; 

} 

} 
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} 

printfC"  Currently  PIR“51x;  Expecting  Xx.  Wait  ", 
pPIOxmit->PIR,GO_XMIT+i) ; 
il  =  askynqO ; 
if(il  ==  0)  { 
break; 

}  else  if(il  ==  -1)  { 
return; 

} 

} 

} 

if (flag  >=  3)  { 

printf ("Transmitter:  PIR  =  */x  (#tries=*/.d)\n" , 
pPI0xmit->PIR, safety) ; 

} 

/♦  Send  GO.RCV  code  to  start  reception  ♦/ 
command  =  G0_RCV+i; 

if  ((pPIOrcvO  =  ptrTab  ->  pPIOrcvO)  !=  0) 

{ 

if (flag  >=  3)  { 

printf  ("Write  G0_RCV  in  rcvO  (code  }lx)\n",commamd) ; 

} 

pPIOrcvO  ->  PARE  =  COMBUFh; 
pPIOrcvO  ->  PAR  =  COMBUFO; 
pPIOrcvO  ->  PDR  =  command; 

if  ((pPIOrcvl  =  ptrTab  ->  pPIOrcvl)  !=  0) 

if (flag  >=  3  )  { 

printf  ("Write  GO.RCV  in  rcvl  (code  y,x)\n" ,  command)  ; 

} 

pPIOrcvl  ->  PARE  =  COMBUFh; 
pPIOrcvl  ->  PAR  =  COMBUFO; 
pPIOrcvl  ->  PDR  ®  command; 

if  ((pPI0rcv2  =  ptrTab  ->  pPI0rcv2)  !=  0) 

{ 

if (flag  >=  3)  { 

printf  ("Write  GO.RCV  in  rcv2  (code  */,x)\n"  .command)  ; 

} 

pPI0rcv2  ->  PARE  =  COMBUFh; 
pPI0rcv2  ->  PAR  =  COMBUFO; 
pPI0rcv2  ->  PDR  =  command; 
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if  ((pPI0rcv3  =  ptrTab  ->  pPI0rcv3)  !®  0) 
if (flag  >=  3)  { 

printf  ("Write  G0_RCV  in  rcv3  (code  */,x)\n"  .command)  ; 

} 

pPI0rcv3  ->  PARE  =  COMBUFh; 
pPI0rcv3  ->  PAR  =  CQMBUFO; 
pPI0rcv3  ->  PDR  *  command; 

} 

> 

> 

} 

/♦  Wait  for  ack.  from  all  rcvrs,  starting  with  the  first  */ 
if  (pPIOrcvO  !=  0) 

safety  =  0; 

while (pPIOrcvO  ->  PIR  !=  command)  •( 
if(safety++  >  SAFENUM)  { 
readPIOO; 
safety  ®  0; 

for(il=0  ;  iKNBOARD  ;  il++)  < 
for(i2=0  ;  i2<4  ;  i2++)  { 

if (ptrDSP[il] [i2]  ==  pPIOrcvO)  { 
printf  ("Transaction  %d:  Waiting  for  receiver  (Xd,*/id) .  \n" , 
i,il,i2) ; 
break; 

} 

} 

} 

printf ("  Current  PIR=7,x,  expecting  7,x.  Wait  ", 
pPIOrcvO->PIR, command) ; 
il  =  askynqO; 
if(il  ==  0)  { 
break; 

}  else  if(il  ==  -1)  { 
return; 

} 

} 

} 

if (flag  >=  3)  { 

printf  ("receiver  0:  PIR  =  7.x  (#tries=7.d)\n", 
pPIOrcvO->PIR, safety) ; 

} 
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if  (pPIOrcvl  !*  0) 

{ 

safety  *  0; 

while  (pPIOrcvl  ->  PIR  !=  commeaid){ 
if(safety++  >  SAFENUM)  { 
readPIOO; 
safety  =  0; 

for(il=0  ;  iKNBOARD  ;  il++)  { 
for(i2=0  ;  i2<4  ;  i2++)  { 

if(ptrDSP[il][i2]  »=  pPIOrcvl)  { 

printf ("Transaction  %d:  Waiting  for  receiver  (Xd,Xd).\n", 
i,il,i2); 

break; 

} 

} 

} 

printf ("  Current  PIR=%x,  expecting  y,x.  Wait  ", 
pPIOrcvl->PIR, command) ; 
il  =  askynqO ; 
if(il  ==  0)  { 
break; 

}  else  if(il  **  -1)  •( 
return; 

} 

} 

} 

if (flag  >=  3)  { 

printf  ("receiver  1:  PIR  =  y,x  (#tries=y.d)\n" , 
pPIOrcvl->PIR, safety) ; 

} 

if  (pPI0rcv2  !=  0) 

{ 

safety  =  0; 

while (pPI0rcv2  ->  PIR  !=  command) { 
if(safety++  >  SAFENUM)  { 
readPIOO; 
safety  *  0; 

for(il=0  ;  il<NB0ARD  ;  il++)  { 
for(i2*0  ;  i2<4  ;  i2++)  { 

if(ptrDSP[il] Ci2]  «=  pPI0rcv2)  { 
printf ("Transaction  Xd:  Waiting  for  receiver  (Xd,Xd).\n", 
i,il,i2); 

break; 
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} 

} 

} 

printfC  Current  PIR*y,x,  expecting  Xx.  Wait  ", 
pPI0rcv2->PIR, command) ; 
il  =  askynqO; 
if(il  **  0)  { 
break; 

}  else  if(il  -1)  { 
return; 

} 

} 

} 

if (flag  >=  3)  { 

printf ("receiver  2:  PIR  *  Xx  (#tries=Xd)\n" , 
pPI0rcv2->PIR, safety) ; 

> 

if  (pPI0rcv3  !=  0) 

{ 

safety  =  0; 

while (pPI0rcv3  ->  PIR  !=  commamd){ 
if(safety++  >  SAFENUM)  { 
readPIOO; 
safety  =  0; 

for(il=0  ;  iKNBOARD  ;  il++)  { 
for(i2=0  ;  i2<4  ;  i2++)  { 

if (ptrDSP[il] [i2]  ==  pPI0rcv3)  { 

printf ("Transaction  Xd:  Waiting  for  receiver  (Xd,Xd).\n", 
i,il,i2); 

break; 

} 

} 

} 

printf ("  Current  PIR=Xx,  expecting  Xx.  Wait  ", 
pPI0rcv3->PIR,commauid) ; 
il  =  askynqO; 
if(il  =*  0)  { 
break; 

}  else  if(il  ==  -1)  ■( 
return ; 

} 

} 

} 

if (flag  >=  3)  { 
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printf ("receiver  3:  PIR  *  Xx  (•tries=Xd)\n", 
pPI0rcv3->PIR, safety) ; 

} 

} 

} 

} 


/♦  Turn  off  the  output  buffer  on  the  trzoismitter  ♦/ 

pPIOxmit  ->  PARE  =  DSPSWREGh; 
pPIOxmit  ->  PAR  »  DSPSWREGl; 

DSPCRCptrTab  ->  transboard]  =  pPIOxmit  ->  PDR  *  SW_LED; 

if (flag  >=  4)  { 
readPIOO ; 
il  =  askynqO: 
if(il  ==  0)  { 
break; 

}  else  if(il  ==  -1)  { 
return ; 

} 

} 

}  /*  end  transact  ♦/ 

/♦  print  results  of  the  GLRT  test  ♦/ 
if (stop  ==  apples [Nappl] .ntrans  )  { 

/*  Wait  until  beam_mle()  is  finished  ♦/ 
if((i  =  apples [Nappl] .mle_bd)  >=  0)  { 
j  =  apples [Nappl] .mle.pr; 
safety  =  0; 

while (ptrDSP[i][j]  ->  PIR  !=  G0.D0NE){ 
if(safety++  >  SAFENUM)  •( 
readPIOO ; 
safety  »  0; 

printf ("Waiting  for  beam.mle  (Xd,Xd)  to  f inish\n",i,j) 
printf ("  Current  PIR=Xx,  expecting  Xx.  Wait  ", 
ptrDSP[i] [j]->PIR,G0_D0NE) ; 
il  =  askynqO; 
if(il  *«  0)  { 
break; 

}  else  if(il  »»  -1)  { 
return ; 

} 
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} 

} 

if (flag  >=  3)  { 

printf  ("Beam.mleO  is  done:  PIR  -  V,x  (#tries=5(d)\n", 
ptrDSP[i] [j] ->PIR, safety) ; 

} 

i  =  taberr_short(fterr) ; 
if(i  >=  0)  { 
ncorrect [i]++; 
histogram [err] ++ ; 

} 

if (flag  =»  1  ftft  (i==0  II  i==2))  break; 

if  (flag  >=  2  Aft  nr  <  nrun  Aft  askynqO  <=  0)  break; 

} 

} 

if  (rdflag  >  0) 

rd(nr,rdflag) ;  /♦  read  output  data,  write  to  .MAT  data  file  ♦/ 
}  /♦  END  NRUN  ♦/ 
if  (mskflag  2) 
f close (mfp) ; 

if (stop  ==  apples [Nappl] .ntrans)  { 
printf ("***DIAGN0SES:  above  threshold  below  threshold\n") ; 
if(errtrans  >=  0)  i 
printf("  Correct:  */,15d  %15d\n", 

ncorrect [0] .ncorrect [l] ) ; 
printf("  Incorrect:  ’/.ISd  y,15d\n", 

ncorrect [2] .ncorrect [3] ) ; 

}■  else  { 

printf("  count:  '/.ISd  Xl5d\n", 

ncorrect [2] .ncorrect [3] ) ; 

} 

printf("\n***HIST0GRAM:\n  Hyp:  ”); 

j  «  apples [Nappl] .nbeam; 
il  =  j -•■apples  [Nappl]  .ncsp; 
for(i=0  ;  i<j  ;  i**)  { 
printf("  beam"); 

} 

for(;  i<il  ;  i-»"»-)  { 
printf("  csp"); 

> 

printf("\n  none"); 
for(i*0;  i<j  ;  i*+)  { 
printfC  y.d,y.d", 

apples [Nappl] ,beam_bd[i] .apples [Nappl] .beam_pr[i]) ; 

} 
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for(  ;  i<il  ;  i++)  { 
printfC"  Xd.Xd", 

apples [Nappl] .csp_bdCi-j3 , apples [Nappl] .csp_pr[i-j]) ; 

} 

priiitf("\n 

for(i«0  ;  i<*il  ;  i++)  •( 

printf ("X5d" .histogram [i] ) ; 

} 

printf("\n''); 

} 

return; 

} 

/*  Display  the  communications  buffer  area  ♦/ 
void  readcomO 

printf ("COMBUFO:  CommzmdXn") ; 
reads.alKCOMBUFO.  1) ; 
printf ("COMBUFl;  Ns\n"); 
reads.all (COMBUFl , 1) ; 

printf  ("C0MBUF2:  Last  Erroneous  CommaindXn") ; 
reads.all (C0MBUF2 , 1) ; 

if (!askyes())  return; 

printf ("C0MBUF3:  Number  of  data  batches  processedXn") ; 
reads.all (C0MBUF3 , 1) ; 

printf ("C0MBUF4:  INPUT,  BEAM.MLE:  #  separate  data  packets  processedXn") ; 
printfC"  BEAM,  CSP:  Fault  type\n"); 

reads.all (C0MBUF4 , 1) ; 

printf ("C0MBUF5:  BEAM,  CSP  :  Maximum  #  whole  integer  delays,  b\n") ; 
reads.all (C0MBUF5 , 1) ; 

if(!askyes())  return; 

printf ("C0MBUF6:  BEAM.MLE:  GLRT  Thresholded  decisionXn") ; 
printfC"  BEAM,  CSP:  Faulty  processor  \n"); 

reads.all (C0MBUF6 , 1) ; 

printf ("C0MBUF7:  BEAM.MLE:  Non-thresholded  decision\n") ; 
printfC"  BEAM,  CSP:  Faulty  sample  numberXn") ; 

reads.all (C0MBUF7,1) ; 

if (faskyesO)  return; 

printf ("COMBUFOf:  BEAM,  CSP:  Real  fault  valueXn") ; 
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readf dsp.all (COMBUFOf , 1) ; 

printf ("COMBUFlf :  BEAN,  CSP:  Imaginary  fault  value\n"); 
readf dsp.all (COMBUFlf , 1) ; 

printf ("MBUF:  Average  squared  magnitude  of  output \n"); 
readfdsp_all(MBUF, 1) ; 

return; 

> 


/♦  Display  all  the  data  buffers  for  BEAH.C  ♦/ 
void  readbeam (board, proc) 
int  board, proc; 

short  int  N; 

/♦  Read  value  of  Ns  from  beam.c  ♦/ 
rd_s(board,proc,COMBUFl,l,AN) ; 
debuglC  Ns=*/.d\n",N); 

/*  Display  the  output  buffer  */ 

printf  ("\nBEAM(%d,y,d)  -  output  beam  response  buffer\n'’,board,proc) ; 
readf dsp(board, proc, BBUF,N*2) ; 

return ; 

> 


/♦  Display  all  the  data  buffers  for  all  BEAM.C  */ 
void  readbeam.allO 
{ 

int  i; 

/*  Figure  out  which  application  has  been  loaded  */ 

f  indapplO ; 

if(Nappl  <  0)  return; 

printf ("Assuming  application  Xs  was  recently  run\n", 
apples [Nappl] . comment) ; 

/♦  Find  next  beaun  processor,  display  its  status  */ 
for(i*0  ;  i<apples [Nappl] .nbeam  ;  i++)  { 

readbeam (apples [Nappl] .beam_bd[i] , apples [Nappl] .beam_pr[i]) ; 

if (askpromptC'More  BEAM  buffers?")  <*  0)  return; 
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} 

return ; 

> 


/♦  Display  the  data  buffers  for  CSPx.C  */ 
void  readcsp (board, proc) 
int  board, proc; 

short  int  N; 

/♦  Read  value  of  H  from  csp.c  ♦/ 
rd_s (board, proc, COMBUFl,!, AN) ; 
debugK"  Ns=y.d\n",N); 

/♦  Display  the  output  buffer  ♦/ 

printf ("\nCSP(%d,yd)  -  output  checkstim  beam  response  buffer\n", 
board, proc) ; 

readfdsp (board, proc, BBUF,N*2) ; 
return ; 

> 


/♦  Display  the  data  buffers  for  all  CSPx.C  ♦/ 
void  readcsp.allO 

int  i; 

/♦  Figure  out  which  application  has  been  loaded  ♦/ 

f  indappK) ; 

if(Nappl  <  0)  return; 

printf  ("Assuming  application  Xs  was  recently  r\in\n", 
apples [Nappl] .comment) ; 

/*  Display  the  status  of  all  the  checksum  processors  ♦/ 
for(i»0  ;  i<apples [Nappl] .ncsp  ;  i++)  { 

readcsp(apples [Nappl] .csp_bd[i] , apples [Nappl] .csp_pr[i]) ; 
if (askpromptC'More  CSP  buffers?")  <*  0)  return; 

> 

return ; 

> 

/*  Display  the  data  buffers  for  the  BEAM.MLE.C  processor  ♦/ 
void  readbeam.mleO 
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{ 

short  int  N; 
short  err; 

int  i,il, board, proc; 

/*  Figure  out  which  application  has  been  loaded  */ 

findappK) ; 

if(Nappl  <  0)  return; 

printf ("Assuming  application  Xs  was  recently  run\n", 
apples [Nappl] .comment) ; 
board  =  apples [Nappl] .mle.bd; 
proc  =  apples [Nappl] .ml e_pr; 

/♦  Read  value  of  N  from  beam.mle.c  */ 
rd_s (board, proc, COMBUFl , 1 ,ftN) ; 
debugK"  Ns=y.d\n",N); 

/*  Diagnosed  failure  */ 

rd_s (board, proc, C0MBUF6, 1 jfterr) ; 

if (err  >  0)  { 

printf ("BEAM.MLE  -  failure  diagnosed  in  processor  */.d  ",err); 

i  =  apples [Nappl] .nbeam; 
if (err  <=  i)  { 

printf  ("(y,d,*/,d)  \n",  apples  [Nappl]  .beam_bd[err-l]  , 
apples [Nappl] .beam_pr[err-l] ) ; 

}  else 

printf  ("(yd,y,d) \n",  apples  [Nappl]  .csp_bd[err-i-l]  , 
apples [Nappl] .csp_pr[err-i-l] ) ; 


> 

}  else  { 

printf ("BEAM.MLE  -  no  failure  detected\n") ; 

> 

/*  Display  the  input  buffers  */ 

for(i®l;  i<=apples [Nappl] .nbeam  +  apples [Nappl] .ncsp  ;  i++)  { 
if(i  <*  apples [Nappl] .nbeam)  { 

printf ("\nBEAM_MLE  -  input  buffer,  BEAM  processor  yd\n",i); 

}  else  { 

printf ("\nBEAM_MLE  -  input  buffer,  CHECKSUM  processor  yd\n",i) 

} 

readf dsp (board , proc , IBUF+ ( i-1 ) ♦8*N , 2*N) ; 
il  »  askpromptC'More  input  buffers?"); 
if(!il)  { 
break; 
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}  else  if(il  »=  -1)  { 
return; 

> 

} 

/♦  Display  the  syndrome  buffers  ♦/ 
for(i=l;  i<=apples[Nappl] .ncsp  ;  i++)  { 

printf ("\nBEAM_MLE  -  syndrome  buffer  %d\n",i); 
readf dsp (board , proc , 0BUF+ (i-1 ) ♦8*N , 2*N) ; 
il  =  askpromptC'More  syndrome  buffers?”); 
if(!il)  { 
breeik; 

}  else  if(il  -1)  { 
return ; 

} 

} 

/♦  Display  the  syndrome  cross-correlations  ♦/ 
printf  (''\nBEAM_MLE  -  syndrome  cross-correlations\n")  ; 
i  s  apples [Nappl] .ncsp; 
i  =  i*i; 

readfdsp(board,proc,RBUF,i) ; 

/♦  Display  the  likelihoods  */ 
printf ("\nBEAM.MLE  -  Likelihoods\n”) ; 
i  *  apples [Nappl] . nbeam  +  apples [Nappl] . ncsp ; 
readfdsp(board,proc,LBUF, i) ; 
if (!askyes())  return; 

/♦  Display  the  computed  threshold  ♦/ 
printf ("\nBEAM_MLE  -  Threshold\n”) ; 
readf  dsp (board , proc , LBUF+ i *4,1); 
if (!askyes())  return; 

/♦  Display  the  fixed  data  */ 
if  (err  >  0)  •[ 

printf ("\nBEAM_MLE  -  processor  %d  fixed  data\n",err) ; 
readf dsp (board, proc, CBUF,2*N) ; 

} 

return; 

} 

/*  Returns  code  of  which  processor  failed  in  ♦error 

Returns  0  if  correct  diagnosis  above  threshold,  1  if  correct 
diagnosis  below  threshold,  2  if  incorrect  above  threshold,  3 
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if  incorrect  below  threshold,  -1  if  *q’  typed  during  printout 
or  if  error. 

♦/ 

static  int  taberr. short (error) 
short  *error; 

{ 

short  int  N; 
short  err; 
int  i,il,errb,errp; 
int  board, proc; 
int  retval; 

/♦  Find  board,  proc  location  of  outmle.c  */ 
board  =  apples [Nappl] .mle.bd; 
proc  =  apples [Nappl] .mle.pr; 

/♦  Read  value  of  N  from  outmle.c  */ 
rd_s(board,proc,COMBUFl , 1 ,ftN) ; 
debugK"  Ns=y.d\n".N); 

/♦  Display  the  magnitudes  */ 
magnitudes () ; 

/*  Diagnosed  failure  */ 

rd_s (board, proc, C0MBUF6,l,4err) ; 

if (err  >  0)  { 

♦error  =  err; 
i  =  apples [Nappl] .nbeam; 
if (err  <=  i)  { 

errb  =  apples [Nappl] .beam.bd [err- 1] ; 
errp  =  apples [Nappl] .beam.pr [err- 1] ; 

}  else  { 

errb  »  apples [Nappl] .csp_bd[err-i-l] ; 
errp  *  apples [Nappl] .csp_pr[err-i-l]  ; 

> 

/♦  Correct  failure  diagnosis  ♦/ 

if(errtrans  >=0  44  errb  *=  errboard  44  errp  ==  errproc)  { 
printf( "CORRECT  DIAGNOSIS:  processor  (Xd,Xd)  failed\n", 
errb, errp) ; 
retval  =  0; 

/♦  Failure  occurred,  but  incorrectly  diagnosed  ♦/ 

}  else  if(errtrans  >*0)  { 

printfC INCORRECT  DIAGNOSIS:  processor  (y.d,y.d)  actually  f  ailed\n"  , 
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errboard.errproc) ; 

printfC  processor  (Xd.Xd)  declared  as  f ailed", errb, errp)  ; 
retval  =  2; 

/♦  No  failure,  but  GLRT  declared  a  failure  anyway  ♦/ 

}  else  { 

printf ("FALSE  ALARM: \n"); 

printfC  no  failure,  but  processor  (y.d,Xd)  declared  faulty\n", 
errb, errp) ; 
retval  =  2; 

} 

}•  else  •( 

♦error  =  apples [Nappl] .nbeam  +  apples [Nappl] .ncsp  +  1; 
if(errtrans  >®  0)  {  /♦  Failure,  missed  by  GLRT  ♦/ 

printf ("FAILURE  NOT  DIAGNOSED : \n" ) ; 
printfC  processor  failure  in  (Zd,y,d)  missed\n", 
errboard,errproc) ; 
retval  *  3; 

}  else  { 

printf ("CORRECT  DIAGNOSIS:  ALL  PROCESSORS  W0RKING\n"); 
retval  »  1; 

} 

} 

return (retval) ; 


/♦  This  subroutine  uses  the  information  in  the  apples  descriptor 
to  figure  out  where  all  the  beam  and  csp  programs  are  supposed 
to  be,  reads  the  complex  outputs  from  outmle,  computes  the  average 
magnitudes,  eind  then  prints  a  listing  of  the  magnitudes  in  a 
4  by  4  square,  matching  th"  magnitude  to  the  processor  location. 

*/ 

static  void  magnitudes () 

{ 

int  board, proc, i, j ,nf,  mleb,  mlep; 

float  f aulty_mag[4]  [4]  ,correct_mag[4]  [4]  ,mauc  =  -1.0; 

/♦  Init  the  Mag  array  ♦/ 

for (board  =  0  ;  board  <  NBOARD  ;  board++)  { 
for (proc  ■  0  ;  proc  <  4  ;  proc++)  { 
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faulty.mag [board] [proc]  =  -1.0; 
correct_mag[boaurd]  [proc]  =  -1.0; 

} 

} 

mleb  =  apples [Nappl] .mle_bd; 
mlep  =  apples [Nappl] .mle.pr; 

/♦  Read  magnitudes,  allocate  to  the  positions  in  the  array  ♦/ 

for(i=  0  ;  i<apples [Nappl] .nbeam;  i++)  { 
board  =  apples [Nappl] .beam_bd[i] ; 
proc  =  apples [Nappl] .beam_pr[i] ; 

rd_fdsp(board,proc,MBUF,l,ftfaulty_mag[board] [proc]) ; 
if  ((board  !=  errboard)  I  I  (proc  !=  errproc)) 

correct.mag [board] [proc]  =  faulty.mag [board] [proc] ; 
else 

rd_fdsp(mleb,mlep,MBUF,l ,4correct_mag [board] [proc]) ; 
if  (max-faulty_mag[bo2Lrd]  [proc]<0)  max=faulty_mag  [board]  [proc]  ; 

} 

for(i=0  ;  i<apples [Nappl] .ncsp;  i++)  {  /*  used  only  for  scaling  */ 

board  =  apples [Nappl] .csp.bd[i] ; 
proc  =  apples [Nappl] .csp_pr[i] ; 

rd_fdsp(board,proc,MBUF,l,ftfaulty_mag[boaurd] [proc]) ; 
if  (max-faulty_mag [board] [proc]<0)  max=faulty_mag [board] [proc] ; 
faulty.mag [board] [proc]  =  -1.0; 

} 

dispO (faulty_mag, correct _mag ,mauc)  ; 

/♦  Print  the  magnitudes  ♦/ 
return; 

} 

dispinitOO 

i 

int  i,j,rl,r2; 
float  tht; 

home_cursor() ; 
clear_screen() ; 
move_cursor(l,30) ; 

printf ("Average  squared  magnitudes"); 
move.cursor (2 , 18) ; 
printf ("uncorrected") ; 
move_cursor(2,45) ; 
printf ("1"); 
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inove_cursor(2,57) ; 
printf ("corrected") ; 
set_scroll(16,24) ; 
inverse.videoO ; 
for  (i=l;  i<=  10;  i++)  { 
inove_cursor(i+2,3) ; 
tht  =  -22.5  +  (i-l)*5.0; 
if  (tht  <  0.0)  printf("X5.1f",tht); 
else  printf ("+%4. If ", tht) ; 

> 

regular_video() ; 
inove_cursor(16,0) ; 

> 

dispO  (Magi  ,MeLg2  .max) 

float  Magi [4] [4] ,Mag2[4] [4] .max; 

int  board. proc.i.  barend,  line; 
float  ratio. samp; 

ratio  =  33.0/max; 
for  (i  »  3;  i  <  13;  i++)  { 
move_cursor(i.lO) ; 
erase.lineO  ; 
move_cursor(i .45) ; 
printf ("I"); 

} 

for  (i  =  1;  i  <=  2;  i++)  { 

for(board  =  0  ;  board  <NB0ARD  ;  board++)  { 
for(proc=0  ;  proc<4  ;  proc++)  ■( 

if  (i==l)  samp  =  Magi [board] [proc] ;  /*  faulty  output  ♦/ 
else  samp  =  Mag2 [board] [proc] ;  /♦  corrected  output  */ 

if(seunp  >*  0)  { 

barend= ( int ) ( samp*rat io)+10+(i-l) *37 ; 
if  ((barend  >  43)  (i  =®  1)) 
barend  =  43; 

else  if  ((barend  >  80)  44  (i  ==  2)) 
barend  •=  80; 
if  (board==0){ 
line  *  proc+3; 

} 

else  if  (board*® 1  44  0<proc  44  proc<3){ 
line  ®  proc+6; 

} 

else  if  (board==2  44  0<proc  44  proc<3)-C 
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line  =  proc+8; 

} 

else  if  (board==3  ftft  0<proc  tk  proc<3){ 
line  =  proc+10; 

} 

else  break; 

barchC  line  ,  10+((i-l)*37)  ,  barend  ); 
if  (samp  ==  maz){ 
inverse.videoO ; 
printf ("X") ; 
regular_video() ; 

} 

} 

} 

} 

} 

move_cursor(16,0) ; 

}■  /♦  end  dispO  ♦/ 

/♦  This  subroutine  tries  to  determine  the  application  loaded. 

If  Nappl  is  -1,  then  asks  user  for  help,  after  displaying  which 
DSP32C  programs  appear  to  have  been  loaded. 

If  Nappl  >=  0,  then  simply  returns 

*/ 

void  findapplO 

{ 

int  i; 


if (Nappl  ==  -1)  { 

printf ("Cainnot  tell  which  application  has  been  loaded. \n"); 
layout  0 ; 

printf ("Which  application  is  this?  \n"); 
for(i=0  ;  apples [i] .comment  ;  i++)  { 

printf  ("  Xd:  y.s\n", i, apples [i]  .comment)  ; 

> 

askint ("Choose?  " ,ANappl) ; 

} 

return ; 

} 


/♦  This  subroutine  reads  the  Wmat  matrix  from  outmleO ,  and 
formats  this  weight  matrix  for  display 

*/ 
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void  weights () 

{ 

int  i, j ,k, board, proc; 
int  nf,nc; 
short  sdata[9]: 
float  Wmat [64] ; 
float  W[4][16]; 

/♦  Find  the  application  */ 

f  indapplO ; 

if(Nappl  <  0)  return; 

/*  Count  number  of  FFT  and  CSP  processors  ♦/ 
nf  =  apples [Nappl] .nbeam; 
nc  *  apples [Nappl] .ncsp; 

/*  Read  the  weight  matrix  ♦/ 

rd.fdsp (apples [Nappl] ,mle_bd, apples [Nappl] .mle_pr,WBUF,nc*(nf+nc) , 
Wmat) : 

/*  Distribute  the  weights  among  the  boards  */ 
for(i=0  ;  i<4;  i++)  { 
for(j*0  ;  j<16  ;  j++)  { 

W[i]  [j]  =  0. ; 

} 

} 

for(i=0  ;  i<nf+nc  ;  i++)  { 
if(i  <  nf)  { 

board  *  apples [Nappl] .beam_bd[i] ; 
proc  =  apples [Nappl] .beam_pr[i] ; 

}  else  { 

board  =  apples [Nappl] .csp_bd[i-nf] ; 
proc  =  apples [Nappl] .csp_pr[i-nf] ; 

} 

for(k=0  ;  k<nc  ;  k++)  { 

W [board] [proc+4*k]  *  Wmat [i+(nf+nc)*k] ; 

} 

} 

/♦  Format  for  display  */ 
for(i®0  ;  i<NB0ARD  ;  i++)  { 
printf ("(Xd,*) :  ",i); 
for(j«0  ;  j<4  ;  j++)  { 

rd_s(i, j ,DSPNAME,8,sdata) ; 
sdata[8]  «  0; 
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printf  ("y.l5s  " ,  (char*)sdata) ; 

} 

printf ("\n") ; 
for(k=0  ;  k<nc;  k++)  { 
printf ("  "); 

for(j=0  ;  j<4  ;  j++)  { 

printf  ("‘/.IS.  7g  ’',W[i]  [j+4*k]); 

> 

printf ("\n"); 

} 

if(i<NBOARD-l)  printf ("\n”) ; 

} 

return ; 

} 

void  rd(batch,readf lag) 
int  batch, readflag; 

MAT  B; 

int  i .board, proc, mem; 
float  Data[10*Ns]  [2]  ; 
double  translate: 

FILE  *fopen(),  ♦fd; 

varout[l]  =  batch/ 10  +  'O';  fileout[38]  =  batch/ 10  +  '0' 
varout[2]  =  batchXlO  +  'O’;  fileout[39]  =  batchXlO  +  '0' 
if  ((fd=fopen(fileout,"wb"))  ==  NULL) 
printf ("Can’t  open  output  file\n"); 
lseek(fd,0L,0) ; 

board  =  apples [Nappl] .mle.bd; 
proc  =  apples [Nappl] ,mle_pr; 

B.type  =  (long) 1000; 

B.mrows  =  (long)Ns;  B.ncols  *  (long)lO; 

B.namlen  =  (long)4:  B.imaigf  -  (long)l; 
if  (readflag”!)  {  /*  read  corrupted  data  ♦/ 

rd_fdsp(boaurd,proc,IBUF, (10*2*Ns) .Data) ; 

}  else  {  /*  read  corrected  data  ♦/ 

for  (i»0;  i<10;  i++)  { 
mem  =  IBUF  +  i*8*Ns; 

if  ((errboard  =*  apples [Nappl] .beam_bd[i] ) 

ftft  (errproc  ==  apples [Nappl] .beam_pr[i] )) 
rd.f dsp (board, proc, CBUF,(2*Ns) ,tData[i*Ns] [0]) ; 
else 

rd_f dsp (board, proc, mem, (2*Ns) ,ftData[i*Ns] [0] ) ; 
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} 

} 

fwrite(tB,sizeof (MAT) ,l,fd) ; 
f write (varout.sizeof (char) .B.namlen.fd) ; 
for  (i  =  0;  i  <  10*Ns;  i++)  {  /*  write  the  REAL  data  */ 
translate  ~  (double) Data [i] [0] ; 
fwrite(fttranslate,sizeof (double) ,l,fd) ; 

} 

for  (i  =  0;  i  <  10*Ns;  i++)  {  /♦  write  the  IMAG  data  */ 
treinslate  =  (double)Data[i]  [1]  ; 
f write (fttranslate.sizeof (double) ,l,fd) ; 

} 

fclose(fd) ; 

printf("Data  written  to  *Xs\n"  ,f ileout) ; 

} 

void  wd (batch, start) 
int  batch, stairt; 

{ 

double  val; 
long  dummy; 
char  ♦buf; 

short  i,  j,  dataCNs] [2*Ne] ; 
int  count,  board,  proc,  mem; 

FILE  *fopen() ,  ♦fd; 

filein[34]  =  (batch+start)/10  +  ’O'; 
filein[35]  =  (batch+start)y,10  +  ’O’; 

if  ((fd=fopen(f ilein,  "rb"))  ==  NULL) 
printf  ("Cain’t  open  y,s\n"  ,f  ilein)  ; 


f read (ftdummy,sizeof (long) ,l,fd) ;  /*  eat  machine  ID  */ 
fread(Adummy,sizeof (long) , l,fd) ;  /♦  eat  #  rows  ♦/ 
f read (ftdummy,sizeof (long) ,l,fd) ;  /♦  eat  #  cols  ♦/ 
fread(Adummy,sizeof (long) , l,fd) ;  /♦  eat  imag.  data  flag  ♦/ 
fread(ftdummy ,sizeof (long) ,l,fd) ;  /*  var.  name  length  ♦/ 
buf  =  (char  ♦ )calloc (dummy, sizeof (char)) ; 

fread(buf,sizeof (char) , dummy, fd) ;  /♦  eat  var.  name  ♦/ 
/♦  NOW  DATA  POINTS  MAY  BE  READ  IN  ♦/ 


for(i  *  0;  i  <  Ns;  i++)  { 
for(j  »  0;  j  <  2*Ne;  j++)  { 

f read(kval , sizeof (double) , 1 ,f d) ; 
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data[i] [j]  ®  (short  int)val; 


> 

> 

fclose(fd) ; 

board  »  apples [Nappl] .input.bd; 
proc  =  apples [Nappl] .input _pr; 
co\mt  »  (int) (2*Ns*Ne) ; 
mem  ®  IBUF  +  batch*2*count ; 
wr_s (board , proc ,mem, count , data) ; 
j  =  start+batch; 

printf ("Batch  number  Xd  written  to  0zX4x  on  (Xd.Xd) .\n" , 
j , mem, board, proc) ; 

} 

/♦  DSPSTRUCT.H  :  include  file  for  68030  files  ♦/ 

/♦  Terminal  characteristics  ♦/ 

#define  TERMLINES  20 
#define  SBIG  Oxffff 

/♦  Number  of  boards  in  the  system  ♦/ 

#define  NBQARD  4 

/*  Define  the  codes  for  communication  with  the  DSP32C's 
Upper  8-bits  =  code 
Lower  8-bits  =  transaction  number 

♦/ 

#define  GO.N  0x0100 
#define  GO.XMIT  0x0200 
#define  GO.RCV  0x0300 
•define  GO.DONE  0x0400 

/*  Define  miscelleineous  constants  */ 

•define  NULL  0 

•define  SAFENUM  25000  /♦  number  of  times  to  test  for  acknowledge  ♦/ 
/♦  VME  BUS  HARDWARE  REGISTERS  */ 

/♦  Master  Address  Modifier  Register  on  VMEchip  (8  bit  port)  ♦/ 
•define  VME.MAMR  0xFFFE200D 

/♦  VMEbus  address  modifiers  ♦/ 

•define  STD_SPR_DAT  0x3D  /♦  Standard  Supervisor  Data  */ 

•define  STD.SPR.COD  0x3E 

•define  STD.USR.DAT  0x39 

•define  STD.USR.COD  0x3A 
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fdefine  SHORT.SUP 
#define  SHORT.USR 


0z2D 

0x29 


/*  Structure  mimicing  the  PIO  register  configuration  of  the  DSP32C  */ 
struct  dspstruct{ 

unsigned  short  int  PAR,  junkl,  PDR,  junk2,  EMR,  junk3, 

ESR,  PCR,  PIR,  junk4,  PCRh,  PARE,  PDR2; 


/*  Pointers  to  the  PIO  registers  on  the  4  boards,  4  DSP  per  board  */ 
extern  struct  dspstruct  ♦ptrDSPM  [4]  ; 

/*  Pointers  to  the  PIO  registers  on  boards  0,  1,  2,  3  ♦/ 
extern  struct  dspstruct  ♦ptrODSPC4] ,*ptrlDSP[4] ,*ptr2DSP[4] ,*ptr3DSP[4] ; 
/*  Pointers  to  the  PIO  registers  on  boards  0,  1,  2,  3,  DSP  #0  */ 
extern  struct  dspstruct  ♦ptrODSPO,  ♦ptrlDSPO,  *ptr2DSP0,  *ptr3DSP0; 


/*  Globals  holding  the  latest  contents  of  read-only  registers 
DSPCR  -  Switch/LED  control  registers  on  the  4  boards 
lOCR  -  10  board  control  register 

*/ 

extern  unsigned  short  int  DSPCR[4] ,  lOCR; 

/♦  Base  Address  of  the  Valley  boards  ♦/ 

#define  base  OxffffOOOO 


/*  Addresses  of  the  4  boards  */ 


fdefine  boardO 
fdefine  boardl 
fdefine  board2 
fdefine  boards 


(base+0x0000400) 

(base+0x0000800) 

(base+0x0001800) 

(base+0x0003400) 


/*  10  board  control/status  register  addresses  */ 
fdefine  lOCONREG  (base+OxOOOCOOO) 
fdefine  lOSTATREG  (base+0x000C002) 
fdefine  lOOUTFIFO  (base+0x000C004) 
fdefine  lOINFIFO  (base+0x000C006) 

/♦  Switch  Register  DSP32C  address  for  DSP  boards  */ 
fdefine  DSPSWREGh  Oxa 
fdefine  DSPSWREGl  0x8000 

fdefine  DSPFIFOh  Oxa 
fdefine  DSPFIFOl  OxcOOO 
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/*  Define  bit  field  codes  for  the  PCR  register  in  the  DSP-32C  ♦/ 

♦define  PCR.RUN  0x01 

♦define  PCR.REG  0x02 

♦define  PCR.ENI  0x04 

♦define  PCR.DMA  0x08 

♦define  PCR.AUTO  0x10 

♦define  PCR.PDF  0x20 

♦define  PCR.PIF  0x40 

♦define  PCR.DMA32  0x100 

♦define  PCR.PIOIS  0x200 

♦define  PCR.FLG  0x400 

/♦  Define  bit  field  codes  for  the  I/O  board  ♦/ 

♦define  lO.BrdO  OxOEOO 
♦define  lO.Brdl  OxODOO 
♦define  I0_Brd2  OxOBOO 
♦define  I0.Brd3  0x0700 

♦define  lO.FifoEn  0x4 
♦define  lO.LedOn  0x8000 

/*  Define  Switch  Register  bit  field  codes  for  the  VE-32C  boards  */ 

♦define  SW.InO  0x00 

♦define  SW.Inl  0x01 

♦define  SW.In2  0x02 

♦define  SW_In3  0x03 

♦define  SW.InDis  0x04 

♦define  SW.OutO  0x00 

♦define  SW.Outl  0x08 

♦define  SW_0ut2  0x10 

♦define  SW_0ut3  0x18 

♦define  SW.OutEn  0x20 

♦define  SW.LED  0x80 

♦define  SW_IRq2MT  0x40 


/*  Define  the  communication  buffer  area  DSP32C  addresses  ♦/ 
♦define  COMBUFh  Oxff 
/*  Shorts  ♦/ 

♦define  COMBUFO  OxffffeO 
♦define  COMBUFl  0xffffe2 
♦define  C0MBUF2  0xffffe4 
♦define  C0MBUF3  OxffffeS 
♦define  C0MBUF4  0xffffe8 
♦define  C0MBUF5  Oxffffea 
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#defiiie  C0MBUF6  Oxffffec 
♦define  C0MBUF7  Oxffffee 
/♦  Floats  */ 

♦define  COMBUFOf  OxfffffO 
♦define  COMBUFlf  0xfffff4 
♦define  C0MBUF2f  OxfffffS 
♦define  C0MBUF3f  Oxfffffc 


/♦  Memory  buffer  locations  in  DSP32C*s  ♦/ 

♦define  IBUF  0x008000  /♦  Input  buffers  in  external  memory  ♦/ 

♦define  BBUF  0x00b040  /♦  Beam  output  buffers  in  external  memory  */ 

♦define  OBUF  OxOOaOOO  /♦  Syndrome  buffers  in  outmle  ♦/ 

♦define  CBUF  OxOOcOOO  /♦  Corrected  processor  output  in  outmle  ♦/ 

♦define  RBUF  OxOObOOO  /*  Syndrome  cross-correlations  in  outmle  ♦/ 

♦define  LBUF  OxOOdOOO  /*  Likelihoods  and  threshold  in  outmle  */ 

♦define  MBUF  OxOOdSOO  /♦  Average  magnitude  buffer  ♦/ 

♦define  WBUF  OxOOeOOO  /♦  Weights  used  by  outmle  ♦/ 

♦define  FASTBUF  OxOOOcOO  /*  Internal  RAM  buffer  ♦/ 

♦define  FASTBUF2  OxfffSOO  /*  Internal  RAM  buffer  */ 

♦define  DSPNAME  OxOOfffO  /♦  DSP32C  prog,  name  (used  by  download ())♦/ 

/♦  Define  struct  defining  which  application  is  loaded  into  DSP32C*s  */ 
extern  int  Nappl;  /♦  Which  application  was  loaded  last  ♦/ 
struct  appl  { 

char  ♦comment;  /*  explanation  of  the  application  ♦/ 

struct  tabtrams  *xable;  /♦  Table  of  data  transfers  for  runbeam  */ 

int  ntrans;  /♦  Length  of  this  table  */ 

int  input.pack;  /*  number  of  packets  tremsmitted  by  input  */ 

int  input _bd, input _pr;  /♦  board,  proc  location  of  input  program  ♦/ 

int  mle.pack;  /*  number  of  packets  input  by  beam.mle  ♦/ 

int  mle_bd,mle_pr;  /♦  board,  proc  location  of  GLRT  program  ♦/ 

int  nbeam;  /*  number  of  beam  processors  ♦/ 

int  beaun_bd[12] ,beam_pr [12] ;/♦  board,  proc  of  up  to  12  beam  programs*/ 

int  ncsp;  /*  number  of  csp  processors  */ 

int  csp.pack;  /♦  number  of  packets  input  by  each  csp  ♦/ 

int  csp_bd[4] ,csp_pr[4] ;/♦  board,  proc  of  up  to  4  csp  programs  ♦/ 

}; 

extern  struct  appl  apples [] ; 
typedef  struct  { 
long  type; 
long  mrows; 
long  ncols; 
long  imagf; 
long  namlen; 
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}  MAT; 


/*  Define  table  struct  defining  the  data  packet  transfers  */ 
struct  tabtrans  { 

unsigned  short  int  lOboard,  SWboardO,SWboardl ,SWboard2,SWboard3; 
struct  dspstruct  *pPIOtrans; 

struct  dspstruct  *pPI0rcv0,  ♦pPIOrcvl,  ♦pPI0rcv2,  *pPI0rcv3; 
int  transboard; 
char  *co]nment: 

>; 


/♦  Function  type  definitions  ♦/ 

/♦  init.c  */ 

extern  void  init_help() ,init_all() ,io_init() ; 
extern  void  dsp.init () ,dsp_board_init() ; 
extern  void  dsp_proc_init() ; 

/*  util.c  ♦/ 

extern  void  util_help_aux() ; 

extern  int  askyesO  .askynqO  ,askprompt()  ; 

extern  int  askint () , askshort () , askf loat () ; 

/*  rw.c  ♦/ 

extern  void  rw_help() ,rw_help_aux() ,help() ; 

extern  void  readf () ,readfdsp() ,readfdsp_all() ; 

extern  void  readsO ,reads_all() ,readi() ,readi_all() ; 

extern  float  pow2() ,ftoieee() ; 

extern  void  writefdspO .writes () ,writei() ; 

extern  void  rd_s() ,wr_s() ,rd_i() ,wr_i() ; 

extern  void  rd_f () ,wr_f () ,rd_fdsp() ,wr_fdsp() ; 

/♦  tabtrans. c  ♦/ 

extern  void  tt.helpO  .runbeamO  ; 

extern  void  readcomO  .readbeamO  ,readbeain_2dl()  .readcspO  ; 
extern  void  readcsp.allO  .readbeam.mleO  ,f  indapplO  .veightsO  ; 
extern  int  taberrO; 

/♦  la.c  ♦/ 

extern  void  la_help()  ,la()  ,laB()  ,laC()  ,download_all()  .layoutO  ; 
extern  int  dovnloadO; 

/♦  wrta.c  ♦/ 

extern  void  wrta.helpO  ,run()  ,run_all()  ,stop()  ; 

/♦  setpcr.c  ♦/ 
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extern  void  setpcr.helpO ,readPI0() ,readPI02() ; 
extern  void  setPCRO ,setPCR_all() ,setPAR() ,setPAR_all() ; 

#define  Ne  15  /♦  number  of  hydrophones  ♦/ 

#define  Ns  34  I*  Ns  is  the  t  complex  samples  per  hydrophone 

/♦  MACROS. H  */ 

#ifdef  DEBUG 
extern  int  debug () ; 

#define  debugO(s)  printf(s) 

#define  debugl(s,x)  printf(s,x) 

♦define  debug2(s,x,y)  printf (s,x,y) 

♦define  debug3(s,x,y,z)  printf (s,x,y,z) 

♦define  debug4(s,w,x,y,z)  printf (s,w,x,y,z) 

♦else 

♦define  debugO(s) 

♦define  debugl(s,x) 

♦define  debug2(s,x,y) 

♦define  debug3(s,x,y,z) 

♦define  debug4(s,w,x,y,z) 

♦endif 


/♦  SCRN.H  ♦/ 
set_scroll(top .bottom) 

/♦  top  2uid  bottom  of  scrolling  region  ♦/ 
int  top, bottom; 

{ 

if  ((top  <  25)  ftft 

(top  <  bottom)  ftft 
(bottom  <  25)) 

pr intf  (  "\033  [5Cd ;  y,dr " ,  top , bottom)  ; 
else  printf ("\033[;r") ; 

} 

barch(line,startcol,stopcol)  /♦  make  barchart  ♦/ 
int  line,  startcol,  stopcol; 

int  i; 

move.cursor (line, startcol) ; 
inverse.videoO ; 

for(i=startcol  ;  i<  stopcol  ;  i++)  •( 
printf ("X") ; 

} 
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regular.videoO ; 

> 

move.cursor (line , col) 
int  line, col; 

{ 

printf  ("\033[y.d;%dH",line.  col) ; 

> 

inverse_video ( ) 
printf("\033[7m"); 

> 

regular.videoO 

■C 

printf ("\033 [Om") ; 

> 

clear_screen() 

printf  (•'\033  [2  J"); 

} 

erase.lineO 

printf ("\033[K"); 

> 

hone.cursorO 

{ 

printf  (’'\033  [OH"); 

> 

set _vert_axis (rowl , row2 , col) 
int  rowl, row2, col; 

int  i; 

for  (i®rowl;  i  <«  row2  ;  i++) 

aiove_cursor(i,col) ; 
printf ("I"); 

> 

> 
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